linguistic corpus
C15286
concept
A linguistic corpus is a large, structured collection of authentic texts or transcribed speech used for analyzing language patterns, usage, and structure.
Aliases (2)
- language model training dataset ×1
- text corpus ×1
Instances (4)
- CORPES XXI corpus
- CREA corpus
- WebText dataset ("text corpus")