Tools & Utilities
Behind every rigorous discourse analysis is a clean, carefully managed dataset. The Tools section in DATS provides a suite of specialized utilities designed to help you prepare, clean, and monitor your corpus before and during your deep analysis.
Located under the toolbox icon (🧰) in the main left navigation bar, these tools handle the logistical challenges often associated with large-scale qualitative research:
- Document Sampler: Automate the creation of statistically sound, representative subsets from massive corpora for manual annotation.
- Duplicate Finder: Keep your quantitative results accurate by easily identifying and removing redundant files or repeatedly scraped web pages.
- Health View: Monitor the exact status of the automated machine-learning preprocessing pipeline to ensure every document is fully extracted, transcribed, and indexed.