Abstract
AbstractA number of artificial intelligence and machine learning problems need to be formulated within a directional space, where classical Euclidean geometry does not apply or needs to be readjusted into the circle. This is typical, for example, in computational linguistics and natural language processing, where language models based on Bag-of-Words, Vector Space, or Word Embedding, are largely used for tasks like document classification, information retrieval and recommendation systems, among others. In these contexts, for assessing document clustering and outliers detection applications, it is often necessary to generate data with directional properties and units that follow some model assumptions and possibly form close groups. In the following we propose a Reduced Variable Neighbourhood Search heuristic which is used to generate high-dimensional data controlled by the desired properties aimed at representing several real-world contexts. The whole problem is formulated as a non-linear continuous optimization problem, and it is shown that the proposed Reduced Variable Neighbourhood Search is able to generate high-dimensional solutions to the problem in short computational time. A comparison with the state-of-the-art local search routine used to address this problem shows the greater efficiency of the approach presented here.
Publisher
Springer International Publishing
Reference42 articles.
1. Aizawa, A.: An information-theoretic perspective of TF-IDF measures. Inf. Process. Manag. 39(1), 45–65 (2003)
2. Atkinson, A.C., Riani, M., Cerioli, A.: Exploring Multivariate Data with the Forward Search. Springer, New York (2004). https://doi.org/10.1007/978-0-387-21840-3
3. Borenstein, Y., Moraglio, A.: Theory and Principled Methods for the Design of Metaheuristics. Springer, Heidelberg (2013). https://doi.org/10.1007/978-3-642-33206-7
4. Borne, P., Philip, F.G., Popescu, D., Stefanoiu, D., Kamel, A.E.: Optimization in Engineering Sciences: Approximate and Metaheuristic Methods. Wiley, New York (2014)
5. Braysy, O.: A reactive variable neighborhood search for the vehicle routing problem with time windows. INFORMS J. Comput. 15(4), 347–368 (2003). https://doi.org/10.1287/ijoc.15.4.347.24896