1. Fainder: A Fast and Accurate Index for Distribution-Aware Dataset Search;Proceedings of the VLDB Endowment;2024-07
2. SET: Searching Effective Supervised Learning Augmentations in Large Tabular Data Repositories;Proceedings of the Conference on Governance, Understanding and Integration of Data for Effective and Responsible AI;2024-06-09
3. Efficient Approximate Maximum Inner Product Search Over Sparse Vectors;2024 IEEE 40th International Conference on Data Engineering (ICDE);2024-05-13
4. Efficiently Estimating Mutual Information Between Attributes Across Tables;2024 IEEE 40th International Conference on Data Engineering (ICDE);2024-05-13
5. Chorus: Foundation Models for Unified Data Discovery and Exploration;Proceedings of the VLDB Endowment;2024-04