Author:
Yokoyama Yuya,Hochin Teruhisa,Nomiya Hiroki
Abstract
AbstractWith a view to solving the mismatches between the ideas of questioners and respondents of Question and Answer (Q&A) sites, impression evaluation experiments have resulted in obtaining nine factors of impressions. Then through multiple regression analysis factor scores have been estimated by utilizing the feature values of statements, such as syntactic information, etc. Those factor scores calculated were subsequently employed for inspecting their potential to detect respondents who are expected and likely to appropriately answer a newly posted question. Nevertheless, our method so far has largely depended on the syntactic information extracted through morphological analysis. Moreover, the number of explanatory variables utilized for obtaining factor scores has been appreciably extravagant and complex. Thus, instead of morphological analysis, 2-gram was applied to the explanatory variables to estimate factor scores. The analysis result with the application of 2-gram has led to greater estimation accuracy than the case of morphological analysis for all nine factors. For further perception and comparison, in this paper, 3-gram was applied to the feature values in place of 2-gram or morphological analysis, in a similar fashion as the previous analysis using 2-gram. Further analysis has shown that 2-gram and 3-gram outperform morphological analysis in terms of estimation accuracy. Comparing the results for the nine factors, 2-gram showed the best results. It could also be suggested that a mere 2-gram or 3-gram would be sufficient in applying N-gram as syntactic information of the feature values to estimate factor scores.
Funder
Japan Society for the Promotion of Science
Publisher
Springer Science and Business Media LLC
Subject
Computer Networks and Communications,Computer Science Applications
Reference17 articles.
1. Yahoo! Chiebukuro (URL, in Japanese), http://chiebukuro.yahoo.co.jp/, 2021–12–16
2. Blooma MJ, Chua AYK and Goh DHL (2008) A predictive framework for retrieving the best answer. In the Proceedings of 2008 ACM Symposium on Applied Computing (SAC08), pp, 1107–1111. https://doi.org/10.1145/1363686.1363944
3. Calefato F, Lanubile F, Novielli N (2019) An empirical assessment of best-answer prediction models in technical Q&A sites. Empir Softw Eng 24:854–901. https://doi.org/10.1007/s10664-018-9642-5
4. Zhang Z, Lu Y, Wilson C and He Z (2019) Making sense of clinical laboratory results: an analysis of questions and replies in a social Q&A community. In the Proceedings of the 17th World Congress on Medical and Health Informatics (MEDINFO 2019), pp 2009–2010. https://doi.org/10.3233/SHTI190759
5. Haq EU, Braud T and Hui P (2020) Community matters more than anonymity: analysis of user interactions on the Quora Q&A platform. In the Proceedings of the International conference series on Advances in Social Network Analysis and Mining (ASONAM 2020), pp 94–98
Cited by
5 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Impression and Suitability of Q&A Statements through Factor Scores Using 2-Gram;2023 15th International Congress on Advanced Applied Informatics Winter (IIAI-AAI-Winter);2023-12-11
2. Application of 3-gram to English Statements Posted at Q&A Sites to Obtain Factor Scores;Communications in Computer and Information Science;2023-10-31
3. Application of 5-gram to Obtain Factor Scores of Japanese Q&A Statements;2023 14th IIAI International Congress on Advanced Applied Informatics (IIAI-AAI);2023-07-08
4. Consideration of Semantics between Q&A Statements to Obtain Factor Score;2023 IEEE/ACIS 21st International Conference on Software Engineering Research, Management and Applications (SERA);2023-05-23
5. Using 4-gram to Obtain Factor Scores of Japanese Statements Posted at Q&A Sites;2022 13th International Congress on Advanced Applied Informatics Winter (IIAI-AAI-Winter);2022-12