加载中…
个人资料
  • 博客等级:
  • 博客积分:
  • 博客访问:
  • 关注人气:
  • 获赠金笔:0支
  • 赠出金笔:0支
  • 荣誉徽章:
正文 字体大小:

How_can_I_extract_phrases_of_difference_POS_combination_from_an_English_corpus?

(2022-03-15 13:03:35)
标签:

corpus

语料库

英语

分类: Corpora-tools语料库工具
     This is a question from a colleague of my supervisor. In the message my supervisor sent to me, he told me a colleague of him asked him how a certain type of phrases, e.g. N+N, can be extracted from an English corpus, as the same work has been done with a Chinese corpus for a comparative study between English and Chinese.
      Here is my response. A) If you have a corpus built and POS tagged by yourself, AntConc can help you with the searching work. B) If you do not have the corpus on your computer, you may do the work by searching some online corpora. The corpora I suggested are a) Corpus of Contemporary American English ( URL: https://www.english-corpora.org/coca/) and b) British National Corpus (URL: https://www.english-corpora.org/bnc/). The two corpora are among the series of online corpora services provided by Mark Davis. Tutorial is available as for the search syntax.
        The following screenshot will show you a little bit what the work would look like.
How_can_I_extract_phrases_of_difference_POS_combination_from_an_English_corpus?
        The above represents a syntax for searching phrases of N+N+system.

How_can_I_extract_phrases_of_difference_POS_combination_from_an_English_corpus?

      The above is a screenshot of the phrase list yielded by searching Corpus of Contemporary American English.



0

阅读 收藏 喜欢 打印举报/Report
  

新浪BLOG意见反馈留言板 欢迎批评指正

新浪简介 | About Sina | 广告服务 | 联系我们 | 招聘信息 | 网站律师 | SINA English | 产品答疑

新浪公司 版权所有