An efficient and scalable search engine for models

Author:

López José Antonio Hernández,Cuadrado Jesús Sánchez

Abstract

AbstractSearch engines extract data from relevant sources and make them available to users via queries. A search engine typically crawls the web to gather data, analyses and indexes it and provides some query mechanism to obtain ranked results. There exist search engines for websites, images, code, etc., but the specific properties required to build a search engine for models have not been explored much. In the previous work, we presented MAR, a search engine for models which has been designed to support a query-by-example mechanism with fast response times and improved precision over simple text search engines. The goal of MAR is to assist developers in the task of finding relevant models. In this paper, we report new developments of MAR which are aimed at making it a useful and stable resource for the community. We present the crawling and analysis architecture with which we have processed about 600,000 models. The indexing process is now incremental and a new index for keyword-based search has been added. We have also added a web user interface intended to facilitate writing queries and exploring the results. Finally, we have evaluated the indexing times, the response time and search precision using different configurations. MAR has currently indexed over 500,000 valid models of different kinds, including Ecore meta-models, BPMN diagrams, UML models and Petri nets. MAR is available at http://mar-search.org.

Funder

Universidad de Murcia

Ministerio de Educación, Cultura y Deporte

Publisher

Springer Science and Business Media LLC

Subject

Modeling and Simulation,Software

Reference59 articles.

1. Apache HBase. https://hbase.apache.org/

2. Apache Lucene. https://lucene.apache.org/

3. GenMyModel. https://www.genmymodel.com/

4. Massif: Matlab simulink integration framework for eclipse. https://github.com/viatra/massif

5. Alalfi, M.H., Cordy, J.R., Dean, T.R., Stephan, M., Stevenson, A.: Models are code too: Near-miss clone detection for simulink models. In: 2012 28th IEEE International Conference on Software Maintenance (ICSM), pp. 295–304. IEEE (2012)

Cited by 10 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

1. Engineering recommender systems for modelling languages: concept, tool and evaluation;Empirical Software Engineering;2024-06-18

2. Advanced discovery mechanisms in model repositories;Software: Practice and Experience;2024-04-25

3. ModelSet: A labelled dataset of software models for machine learning;Science of Computer Programming;2024-01

4. EA ModelSet – A FAIR Dataset for Machine Learning in Enterprise Modeling;Lecture Notes in Business Information Processing;2023-11-25

5. TrackMine: Topic Tracking in Model Mining using Genetic Algorithm;2023 13th International Conference on Computer and Knowledge Engineering (ICCKE);2023-11-01

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3