A corpus may contain texts in a single language (monolingual corpus) or text data in multiple languages (multilingual corpus). Multilingual corpora that have been specially formatted for side-by-side comparison are called aligned parallel corpora.
In order to make the corpora more useful for doing linguistic research, they are often subjected to a process known as annotation.
沒有留言:
張貼留言