An evaluation dataset for intent classification and out-of-scope prediction S Larson, A Mahendran, JJ Peper, C Clarke, A Lee, P Hill, K Leach, ... arXiv preprint arXiv, 2019 | 514 | 2019 |
Outlier Detection for Improved Data Quality and Diversity in Dialog Systems S Larson, A Mahendran, A Lee, JK Kummerfeld, P Hill, MA Laurenzano, ... NAACL, 2019 | 48 | 2019 |
LSOIE: A Large-Scale Dataset for Supervised Open Information Extraction J Solawetz, S Larson EACL, 2021 | 30 | 2021 |
A Survey of Intent Classification and Slot-Filling Datasets for Task-Oriented Dialog S Larson, K Leach arXiv preprint arXiv:2207.13211, 2022 | 20 | 2022 |
Evaluating out-of distribution performance on document image classifiers S Larson, G Lim, Y Ai, D Kuang, K Leach Thirty-sixth Conference on Neural Information Processing Systems Datasets …, 2022 | 19* | 2022 |
Iterative Feature Mining for Constraint-Based Data Collection to Increase Data Diversity and Model Robustness S Larson, A Zheng, A Mahendran, R Tekriwal, A Cheung, E Guldan, ... Proceedings of the 2020 Conference on Empirical Methods in Natural Language …, 2020 | 18 | 2020 |
Inconsistencies in Crowdsourced Slot-Filling Annotations: A Typology and Identification Methods S Larson, A Cheung, A Mahendran, K Leach, JK Kummerfeld Proceedings of the 28th International Conference on Computational …, 2020 | 15 | 2020 |
Augraphy: A data augmentation library for document images A Groleau, KW Chee, S Larson, S Maini, J Boarman International Conference on Document Analysis and Recognition, 384-401, 2023 | 11 | 2023 |
Redwood: Using Collision Detection to Grow a Large-Scale Intent Classification Dataset S Larson, K Leach arXiv preprint arXiv:2204.05483, 2022 | 10 | 2022 |
On Evaluation of Document Classification using RVL-CDIP S Larson, G Lim, K Leach Proceedings of the 17th Conference of the European Chapter of the …, 2023 | 6* | 2023 |
Systems and methods for automatically configuring training data for training machine learning models of a machine learning-based dialogue system including seeding training … S Larson, A Mahendran, A Lee, JK Kummerfeld, P Hill, MA Laurenzano, ... US Patent 10,679,150, 2020 | 6 | 2020 |
Systems and methods implementing data query language and utterance corpus implements for handling slot-filling and dialogue intent classification data in a machine learning … S Larson, K Leach, MA Laurenzano US Patent 11,183,175, 2021 | 5 | 2021 |
Systems and methods for mixed setting training for slot filling machine learning tasks in a machine learning task-oriented dialogue system DC Michelin, JK Kummerfeld, K Leach, S Larson, JJ Peper, Y ZHANG US Patent 11,043,208, 2021 | 5 | 2021 |
Exploring Out-of-Distribution Generalization in Text Classifiers Trained on Tobacco-3482 and RVL-CDIP S Larson, N Singh, S Maheshwari, S Stewart, U Krishnaswamy Document Analysis and Recognition–ICDAR 2021 Workshops: Lausanne …, 2021 | 4 | 2021 |
Systems and methods for constructing an artificially diverse corpus of training data samples for training a contextually-biased model for a machine learning-based dialogue system A Lee, S Larson, C Clarke, K Leach, JK Kummerfeld, P Hill, J Hauswald, ... US Patent 10,796,104, 2020 | 4 | 2020 |
Data Query Language and Corpus Tools for Slot-Filling and Intent Classification Data S Larson, E Guldan, K Leach Proceedings of The 12th Language Resources and Evaluation Conference, 7060-7068, 2020 | 4 | 2020 |
ShabbyPages: A Reproducible Document Denoising and Binarization Dataset A Groleau, KW Chee, S Larson, S Maini, J Boarman arXiv preprint arXiv:2303.09339, 2023 | 1 | 2023 |
Systems and methods for automatically detecting and repairing slot errors in machine learning training data for a machine learning-based dialogue system S Larson, A Mahendran, P Hill, JK Kummerfeld, MA Laurenzano, L Tang, ... US Patent 10,929,761, 2021 | 1 | 2021 |
Generating Hard-Negative Out-of-Scope Data with ChatGPT for Intent Classification Z Li, S Larson, K Leach arXiv preprint arXiv:2403.05640, 2024 | | 2024 |
On Evaluation of Document Classification using RVL-CDIP S Larson, G Lim, K Leach arXiv preprint arXiv:2306.12550, 2023 | | 2023 |