Active Learning: Encoder-Decoder-Outlayer and Vector Space Diversification Sampling-Reference-Cited by-同舟云学术

Active Learning: Encoder-Decoder-Outlayer and Vector Space Diversification Sampling

Published:2023-06-23 Issue:13 Volume:11 Page:2819
ISSN:2227-7390
Container-title:Mathematics
language:en
Short-container-title:Mathematics

Author:

Zeng Hongyi¹,Kong Fanyi²

Affiliation:

1. Department of Computer Science, University of Toronto, Toronto, ON M5S 0A5, Canada

2. Department of Mechanical & Industrial Engineering, Northeastern University, Boston, MA 02115, USA

Abstract

This study introduces a training pipeline comprising two components: the Encoder-Decoder-Outlayer framework and the Vector Space Diversification Sampling method. This framework efficiently separates the pre-training and fine-tuning stages, while the sampling method employs pivot nodes to divide the subvector space and selectively choose unlabeled data, thereby reducing the reliance on human labeling. The pipeline offers numerous advantages, including rapid training, parallelization, buffer capability, flexibility, low GPU memory usage, and a sample method with nearly linear time complexity. Experimental results demonstrate that models trained with the proposed sampling algorithm generally outperform those trained with random sampling on small datasets. These characteristics make it a highly efficient and effective training approach for machine learning models. Further details can be found in the project repository on GitHub.

Publisher

MDPI AG

Subject

General Mathematics,Engineering (miscellaneous),Computer Science (miscellaneous)

Link

https://www.mdpi.com/2227-7390/11/13/2819/pdf

Reference24 articles.

1. Bražinskas, A., Nallapati, R., Bansal, M., and Dreyer, M. (2022). Efficient few-shot fine-tuning for opinion summarization. arXiv.

2. Dodge, J., Ilharco, G., Schwartz, R., Farhadi, A., Hajishirzi, H., and Smith, N. (2020). Fine-tuning pretrained language models: Weight initializations, data orders, and early stopping. arXiv.

3. Hu, E.J., Shen, Y., Wallis, P., Allen-Zhu, Z., Li, Y., Wang, S., Wang, L., and Chen, W. (2021). Lora: Low-rank adaptation of large language models. arXiv.

4. Settles, B. (2023, April 01). Active Learning Literature Survey: Semantic Scholar, Active Learning Literature Survey|Semantic Scholar. Available online: https://www.semanticscholar.org/paper/Active-Learning-Literature-Survey-Settles/818826f356444f3daa3447755bf63f171f39ec47.

5. Dai, Y., Yang, C., Liu, Y., and Yao, Y. (2023). Latent-Enhanced Variational Adversarial Active Learning Assisted Soft Sensor. IEEE Sens. J.