【转】免费的英语语料库汇总
(2012-07-08 22:08:55)
标签:
杂谈 |
分类: 本地化翻译 |
免费的英语语料库汇总
--------------------------------------------------------------------------------
Some are not corpora, but (I think) they are corpus-related. The list is incomplete and just let me know if I omit any corpora.
1. The best corpora
COCA:
BNC-BYU:
TIME-BYU:http://corpus.byu.edu/time/
JustTheWord:http://193.133.140.102/JustTheWord/index.html
BNCweb:http://bncweb.lancs.ac.uk/bncwebSignup/user/login.php
Jukuu(句酷):
Leeds:
Lextutor:
Web Concordancer:
2. General Corpora
Jiaoda(上海交大):
Brown/lob Corpus:
Corpuseye:
Corpus swb :
BNC:
Bank of English:
ANC:
ICE Corpora
3. English-Chinese Parellel Corpora(英汉双语语料库)
CEO:http://www.fleric.org.cn/ceo/
Babel:http://score.crpp.nie.edu.sg/cgi-bin/babel/paraconc.pl
The Dream Of Red Chamber(红楼梦):
HK
Poly U(香港理工大学):
Laozi(老子):
Xiamen U(厦门大学):
4. Textbook Corpora
College English:http://www.corpus4u.org/corpora/COLEN.rar
New Horizon College English(NHCE):http://www.nhce.edu.cn
New Concept English:http://luwei.2288.org/oechw/hanyu/da...e/framconc.asp
Family Album USA:
5. Business and Financial Corpora
Business English Corpus (BEC):
PolyU Business Corpus:
Business Letter Corpus:
Financial Corpus:
6. Literary Corpora
The Online Corpus of Old English Poetry
(OCOEP):
Shakespeare's Sonnets Corpus:
Blues Lyric Poetry Corpus:
Canadian Poets Anthology Corpus:
CAPA (contemporary American Poetry
Archive):
Claremont Corpus of Elizabethan Verse:
Late Modern English Prose Corpus:
New Dragon Book of Verse Corpus :
Northwest Coast Indian mythology Corpus:
Online Classics Horror and Phantasy
Fiction:
SETIS Australian Literary and Historical
Texts:
Corpus of Middle English Prose and Verse:
HarryPotter Corpus:
Towneley Plays Corpus:
Web Concordances Site:
York Miracle Play Cycle Corpus:
ME Texts Anthology Corpus:
7. Web As Corpus
Web As Corpus :http://webascorpus.org/searchwac.htm
Web Corp:
WebCONC:
8. Learner Corpora
Chinese Learners of English(中国英语学习者):
Corpus of Hungarian students' essays:
http://joeandco.blogspot.com/2008/06...subcorpus.html
The Multimedia Adult English Learner Corpus:http://www.labschool.pdx.edu/maelc_access.html
The Uppsala Student English Corpus (USE):
Dowloadable data at
Michigan Corpus of Upper-level Student
Papers:
IWILL Corpus:
Wordneighbours:
PICLE Corpus:http://ifa.amu.edu.pl/~kprzemek/conc...h_adv_new.html
EVA Corpus:
PolyU Language Bank Concordancer:
http://langbank.engl.polyu.edu.hk/en...ng=1&corpus=16
The Montclair Electronic Language Learners' Database under construction)
http://www.chss.montclair.edu/linguistics/MELD/
Singapore Corpus of Research in Education:http://score.crpp.nie.edu.sg/score/index.htm
Birkbeck Spelling Error Corpus:
Open Mind Commonsense Corpus:
Corpus for Higher Education:
http://langbank.engl.polyu.edu.hk/en...ng=1&corpus=11
National Taiwan Normal University Corpora:
http://llrc.eng.ntnu.edu.tw/English/search/Default.htm
http://llrc.eng.ntnu.edu.tw/English/search/tag.htm
http://llrc.eng.ntnu.edu.tw/English/search/tag2.htm
ELISA corpus:
9. News Corpora
Reuters Corpus:
arpers Magazine 1879-1880 Corpus:
Hong Kong South China Morning Post
Corpus:
New York Newspaper Advertisements and News Items
1777-1779:
VOA Special English Corpus:
VOA Special English audio and text
corpus:
American News Stories Corpus:
MPQA Opinion Corpus: