Skip to content

Tools & Utilities

Behind every rigorous discourse analysis is a clean, carefully managed dataset. The Tools section in DATS provides a suite of specialized utilities designed to help you prepare, clean, and monitor your corpus before and during your deep analysis.

Located under the toolbox icon (🧰) in the main left navigation bar, these tools handle the logistical challenges often associated with large-scale qualitative research:

  • Document Sampler: Automate the creation of statistically sound, representative subsets from massive corpora for manual annotation.
  • Duplicate Finder: Keep your quantitative results accurate by easily identifying and removing redundant files or repeatedly scraped web pages.
  • Health View: Monitor the exact status of the automated machine-learning preprocessing pipeline to ensure every document is fully extracted, transcribed, and indexed.