1. Document sublanguage clustering to detect medical specialty in cross-institutional clinical texts;Doing-Harris;Proceedings of the ACM International Workshop on Data and Text Mining in Biomedical Informatics,2013
2. Document clustering of clinical narratives: a systematic study of clinical sublanguages;Patterson;AMIA AnnuSympProc,2011
3. Longitudinal analysis of new information types in clinical notes;Zhang;AMIA Jt Summits TranslSciProc,2014
4. Redundancy in electronic health record corpora: analysis, impact on text mining performance and mitigation strategies;Cohen;BMC Bioinf.,2013
5. Analysis of a probabilistic model of redundancy in unsupervised information extraction;Artif. Intell.,2010