This is a question from a colleague of my
supervisor. In the message my supervisor sent to me, he told me a
colleague of him asked him how a certain type of phrases, e.g. N+N,
can be extracted from an English corpus, as the same work has been
done with a Chinese corpus for a comparative study between English
and Chinese.
Here is my response. A) If you have a corpus built and POS tagged
by yourself, AntConc can help you with the searching work. B) If
you do not have the corpus on your computer, you may do the work by
searching some online corpora. The corpora I suggested are a)
Corpus of Contemporary American English ( URL: https://www.english-corpora.org/coca/) and b) British
National Corpus (URL: https://www.english-corpora.org/bnc/). The two corpora
are among the series of online corpora services provided by Mark
Davis. Tutorial is available as for the search syntax.
The following screenshot will show you a little bit what the work
would look like.
The above represents a syntax for
searching phrases of N+N+system.
The above is a screenshot of the phrase list yielded by searching
Corpus of Contemporary American English.
加载中,请稍候......