Documentos de Académico
Documentos de Profesional
Documentos de Cultura
Magorzata Warzecha
What is a corpus?
What is a corpus?
Corpus:
From the Latin for body (plural corpora), a
corpus is a body of language representative of a
particular variety of language or genre which is
collected and stored in electronic form for
analysis using concordance software.
Types of corpora:
1. Specialised corpus
Types of corpora:
1. Specialised corpus
2. General corpus
Types of corpora:
1. Specialised corpus
2. General corpus
3. Multilingual corpora
Types of corpora:
1.
2.
3.
4.
Specialised corpus
General corpus
Multilingual corpora
Parallel corpus
Types of corpora:
1.
2.
3.
4.
5.
Specialised corpus
General corpus
Multilingual corpora
Parallel corpus
Learner corpus
Types of corpora:
1.
2.
3.
4.
5.
6.
Specialised corpus
General corpus
Multilingual corpora
Parallel corpus
Learner corpus
Historical or Diachronic corpus
Types of corpora:
1.
2.
3.
4.
5.
6.
7.
Specialised corpus
General corpus
Multilingual corpora
Parallel corpus
Learner corpus
Historical or Diachronic corpus
Monitor corpus
but:
2.
Criticism
3.
Example of annotation
2. Criticism by Chomsky
Any natural corpus will be skewed. Some
sentences won't occur because they are obvious,
others because they are false, still others because
they are impolite. The corpus, if natural, will be so
wildly skewed that the description would be no
more than a mere list.
(Chomsky, University of Texas, 1962)
2. Criticism by Chomsky
Chomsky: The verb perform cannot be used
with mass word objects: one can perform a task
but one cannot perform labour.
Hatcher: How do you know, if you don't use a
corpus and have not studied the verb perform?
Chomsky: How do I know? Because I am a
native speaker of the English language.
(Hill, 1962)
Sources
Websites:
https://www.futurelearn.com/courses/corpus-linguistics
http://www.lancaster.ac.uk/
http://www.antlab.sci.waseda.ac.jp/software.html
Litearature: