{"id":26916,"date":"2019-12-27T18:19:46","date_gmt":"2019-12-27T12:19:46","guid":{"rendered":"https:\/\/www.enago.com\/academy\/?p=26916"},"modified":"2021-11-12T16:21:54","modified_gmt":"2021-11-12T10:21:54","slug":"how-to-improve-your-academic-writing-by-using-corpora","status":"publish","type":"post","link":"https:\/\/www.enago.com\/academy\/how-to-improve-your-academic-writing-by-using-corpora\/","title":{"rendered":"How to Improve Your Academic Writing Using Language Corpora"},"content":{"rendered":"<p>No matter how brilliant a researcher you are, you must be able to write about your research effectively to have any impact on the scientific world. Unfortunately for most of us, research and writing are two very different skills. Even the most talented researchers may struggle when it comes to writing clearly and concisely about their work. The burden is doubled for non-native English speakers. While English is widely accepted as the global language of science, it is also a tricky and difficult language to learn. What is the difference between \u201cput on\u201d and \u201cput off\u201d? Do you \u201ctake\u201d a sample or \u201cmake\u201d a sample? Where can you go when you need help with English writing? One little known, underused source of help for academic writing is language corpora. In this article, we will talk about how you can take advantage of this resource to improve your writing and increase your confidence with English.<\/p>\n<p><a href=\"https:\/\/www.enago.com\/plagiarism-checker\/?utm_source=academy&amp;utm_medium=referral&amp;utm_campaign=banner&amp;utm_term=article\"><img decoding=\"async\" class=\"aligncenter size-full wp-image-36233 lazyload\" data-src=\"https:\/\/www.enago.com\/academy\/wp-content\/uploads\/2019\/05\/M3_inarticle-service-banner_900x270_1.jpg\" alt=\"\" width=\"900\" height=\"270\" data-srcset=\"https:\/\/www.enago.com\/academy\/wp-content\/uploads\/2019\/05\/M3_inarticle-service-banner_900x270_1.jpg 900w, https:\/\/www.enago.com\/academy\/wp-content\/uploads\/2019\/05\/M3_inarticle-service-banner_900x270_1-470x141.jpg 470w, https:\/\/www.enago.com\/academy\/wp-content\/uploads\/2019\/05\/M3_inarticle-service-banner_900x270_1-750x225.jpg 750w, https:\/\/www.enago.com\/academy\/wp-content\/uploads\/2019\/05\/M3_inarticle-service-banner_900x270_1-768x230.jpg 768w\" data-sizes=\"(max-width: 900px) 100vw, 900px\" src=\"data:image\/svg+xml;base64,PHN2ZyB3aWR0aD0iMSIgaGVpZ2h0PSIxIiB4bWxucz0iaHR0cDovL3d3dy53My5vcmcvMjAwMC9zdmciPjwvc3ZnPg==\" style=\"--smush-placeholder-width: 900px; --smush-placeholder-aspect-ratio: 900\/270;\" \/><\/a><\/p>\n<h2>What is a Language Corpus?<\/h2>\n<p>A language corpus is a collection of electronic text used for research purposes. Language corpora were originally created by researchers, usually linguists, for research purposes. Some popular corpora include the <u><a href=\"https:\/\/www.english-corpora.org\/coca\/\" target=\"_blank\" rel=\"noopener\">Corpus of Contemporary American English<\/a><\/u> (COCA), <u><a href=\"https:\/\/www.english-corpora.org\/coha\/\" target=\"_blank\" rel=\"noopener\">Corpus of Historical American English<\/a><\/u> (COHA), <u><a href=\"https:\/\/books.google.com\/ngrams\" target=\"_blank\" rel=\"noopener\">Google Books Ngrams viewer<\/a><\/u>, <u><a href=\"https:\/\/quod.lib.umich.edu\/cgi\/c\/corpus\/corpus\" target=\"_blank\" rel=\"noopener\" class=\"broken_link\">Michigan Corpus of Academic Spoken English<\/a><\/u>, <u><a href=\"https:\/\/hypcol.marutank.net\/\" target=\"_blank\" rel=\"noopener\">Hyper Collocation<\/a><\/u>, and more. These corpora offer a searchable collection of English used by native speakers in different contexts. In English language classes, they are often <u><a href=\"https:\/\/www.britishcouncil.org\/voices-magazine\/corpora-english-language-teaching\" target=\"_blank\" rel=\"noopener\" class=\"broken_link\">used as a tool by teachers<\/a><\/u> who want to show their students how a word is used in real life by native speakers.<\/p>\n<p>What is the difference between a corpus and a dictionary? Why would a non-native English speaker turn to a corpus instead of a dictionary for answers? First of all, while a dictionary can define a word for you, it often does not include many usage examples. The word \u201cextract\u201d means \u201cto remove or take out.\u201d But if I need to know how to explain a physical action I took in my research, will I say \u201cextract to\u201d or \u201cextract from\u201d? A dictionary probably cannot answer that question, but a language corpus can.<\/p>\n<p>Familiarizing yourself with some simple corpus search functions will make a new range of tools available to you. Many corpora allow searches for synonyms and different word forms. For example, you could search for the verb form of \u201cextract\u201d using COCA and return \u201cextracts,\u201d \u201cextracting,\u201d \u201cextracted,\u201d and \u201cextract.\u201d You could also select \u201ccollates\u201d for your search string and return a list of words that are frequently found together with the word \u201cextract.\u201d Clicking the \u201chelp\u201d icon will offer you a variety of search function methods. For example, if you type in [=extract] you can find a list of synonyms for the word such as remove, separate, get, fetch, and so on.<\/p>\n<p>Another advantage of language corpora is that they are updated more frequently than dictionaries. A search in Webster\u2019s dictionary in early 2019 would not have returned a result for the term \u201cbioabsorbable.\u201d But the word has been in use and popularized thanks to new advances in technology that were <u><a href=\"https:\/\/www.biospace.com\/article\/releases\/boston-scientific-announces-scheduled-presentations-at-transcatheter-cardiovascular-therapeutics-2019\/\" target=\"_blank\" rel=\"noopener\">presented in 2019<\/a><\/u>.\u00a0 The word <a href=\"https:\/\/www.msn.com\/en-us\/news\/us\/640-new-words-added-to-merriam-webster-dictionary\/ar-BBWevK6\" target=\"_blank\" rel=\"noopener\">was officially added<\/a> to Merriam Webster\u2019s in the middle of 2019. If you were looking for examples of how to write using this word, corpora would be there to provide you with examples of contemporary use.<\/p>\n<h2>How Do I Use Language Corpora?<\/h2>\n<p>Learning to search on different language corpus tools can seem confusing at first. But don\u2019t worry- it gets easier quickly. Now let\u2019s look at how to choose a corpus and how to search for different words on these sites to get useful results.<\/p>\n<p>You should choose your language corpus depending on what your goal is. If you are looking for how to use a word that is not specific to your discipline, then COCA will be a great place to start. Let\u2019s say you want to know if you should say \u201cextract to\u201d or \u201cextract from.\u201d You can click on the link to COCA above and enter the term \u201cextract to\u201d in the search bar. Then you will click \u201cfind matching strings.\u201d<\/p>\n<p><img decoding=\"async\" class=\"aligncenter size-full wp-image-26961 lazyload\" data-src=\"https:\/\/www.enago.com\/academy\/wp-content\/uploads\/2019\/12\/COCA-1.jpg\" alt=\"\" width=\"1195\" height=\"642\" data-srcset=\"https:\/\/www.enago.com\/academy\/wp-content\/uploads\/2019\/12\/COCA-1.jpg 1195w, https:\/\/www.enago.com\/academy\/wp-content\/uploads\/2019\/12\/COCA-1-428x230.jpg 428w, https:\/\/www.enago.com\/academy\/wp-content\/uploads\/2019\/12\/COCA-1-768x413.jpg 768w, https:\/\/www.enago.com\/academy\/wp-content\/uploads\/2019\/12\/COCA-1-750x403.jpg 750w\" data-sizes=\"(max-width: 1195px) 100vw, 1195px\" src=\"data:image\/svg+xml;base64,PHN2ZyB3aWR0aD0iMSIgaGVpZ2h0PSIxIiB4bWxucz0iaHR0cDovL3d3dy53My5vcmcvMjAwMC9zdmciPjwvc3ZnPg==\" style=\"--smush-placeholder-width: 1195px; --smush-placeholder-aspect-ratio: 1195\/642;\" \/> When we perform this search for \u201cextract to,\u201d we return only 52 uses, while \u201cextract from\u201d returns 233.<\/p>\n<p><img decoding=\"async\" class=\"aligncenter size-full wp-image-26962 lazyload\" data-src=\"https:\/\/www.enago.com\/academy\/wp-content\/uploads\/2019\/12\/COCA-2.jpg\" alt=\"\" width=\"1197\" height=\"490\" data-srcset=\"https:\/\/www.enago.com\/academy\/wp-content\/uploads\/2019\/12\/COCA-2.jpg 1197w, https:\/\/www.enago.com\/academy\/wp-content\/uploads\/2019\/12\/COCA-2-470x192.jpg 470w, https:\/\/www.enago.com\/academy\/wp-content\/uploads\/2019\/12\/COCA-2-768x314.jpg 768w, https:\/\/www.enago.com\/academy\/wp-content\/uploads\/2019\/12\/COCA-2-750x307.jpg 750w\" data-sizes=\"(max-width: 1197px) 100vw, 1197px\" src=\"data:image\/svg+xml;base64,PHN2ZyB3aWR0aD0iMSIgaGVpZ2h0PSIxIiB4bWxucz0iaHR0cDovL3d3dy53My5vcmcvMjAwMC9zdmciPjwvc3ZnPg==\" style=\"--smush-placeholder-width: 1197px; --smush-placeholder-aspect-ratio: 1197\/490;\" \/>We can click on \u201ccontext\u201d to see exactly how it is used. Based on this search, we will decide that \u201cextract from\u201d is the correct word form to use.<\/p>\n<p><img decoding=\"async\" class=\"aligncenter size-full wp-image-26963 lazyload\" data-src=\"https:\/\/www.enago.com\/academy\/wp-content\/uploads\/2019\/12\/COCA-3.jpg\" alt=\"\" width=\"1204\" height=\"955\" data-srcset=\"https:\/\/www.enago.com\/academy\/wp-content\/uploads\/2019\/12\/COCA-3.jpg 1204w, https:\/\/www.enago.com\/academy\/wp-content\/uploads\/2019\/12\/COCA-3-290x230.jpg 290w, https:\/\/www.enago.com\/academy\/wp-content\/uploads\/2019\/12\/COCA-3-768x609.jpg 768w, https:\/\/www.enago.com\/academy\/wp-content\/uploads\/2019\/12\/COCA-3-605x480.jpg 605w, https:\/\/www.enago.com\/academy\/wp-content\/uploads\/2019\/12\/COCA-3-279x220.jpg 279w\" data-sizes=\"(max-width: 1204px) 100vw, 1204px\" src=\"data:image\/svg+xml;base64,PHN2ZyB3aWR0aD0iMSIgaGVpZ2h0PSIxIiB4bWxucz0iaHR0cDovL3d3dy53My5vcmcvMjAwMC9zdmciPjwvc3ZnPg==\" style=\"--smush-placeholder-width: 1204px; --smush-placeholder-aspect-ratio: 1204\/955;\" \/>For more discipline-specific words, you can try the Michigan Corpus of Academic Spoken English (<u><a href=\"https:\/\/quod.lib.umich.edu\/cgi\/c\/corpus\/corpus?c=micase;page=simple\" target=\"_blank\" rel=\"noopener\" class=\"broken_link\">MICASE corpus<\/a>)<\/u>, which offers some limited examples. The advantage of Michigan\u2019s tool is that <u><a href=\"https:\/\/teachingcommons.stanford.edu\/teachingwriting\/pwr-guide\/teaching-multilingual-students\/academic-language-tools\" target=\"_blank\" rel=\"noopener\" class=\"broken_link\">you can search by discipline or type of academic event.<\/a><\/u> If you are writing to prepare for a specific type of event or branching into a new part of your field, this tool can be particularly helpful for you.<\/p>\n<p>You may also be wondering about the differences between American and British English. Don\u2019t worry- there are corpora to help you with those searches too. The <u><a href=\"https:\/\/www.english-corpora.org\/\" target=\"_blank\" rel=\"noopener\">BYU Corpus site<\/a><\/u> has links to British English and American English corpora, and you can search and compare to see what terms or phrases are used in one style over the other. Should we say \u201cin hospital\u201d or \u201cin the hospital\u201d? A search of the corpora shows that Americans favor \u201cin the hospital,\u201d while British English speakers simply say \u201cin hospital.\u201d<\/p>\n<h2>A Few Notes of Caution<\/h2>\n<p>You may be very excited to begin using this new tool. You should be! Language corpora can be extraordinarily helpful in providing you with real-world examples of language that you would have difficulty finding otherwise. Dictionaries and Google searches do not provide nearly the amount of detail and context that corpora do. However, there are still some <u><a href=\"http:\/\/arrantpedantry.com\/wp-content\/uploads\/2016\/04\/Copyediting-and-Corpus-Linguistics.pdf\" target=\"_blank\" rel=\"noopener\">points of caution to keep in mind<\/a><\/u> when relying on corpora to improve your writing. First, corpora don\u2019t tell you what is correct and incorrect. They simply tell you what usage is common. You can use corpora to improve your writing, but you may need to dig deeper and compare your data from corpora with other sources.<\/p>\n<p>That said, language is a funny thing. What is key to remember is that language is about communication. When you seek out how to use certain words, real-world examples are a great tool that can give you a new and deeper level of understanding of the words themselves. For that reason, language corpora are a great tool to have in your toolbox when it comes to improving your academic writing.<\/p>\n<p>Do you use language corpora to help you in academic writing? Which corpus do you find most helpful? What are some other good resources for ESL writers to improve their academic writing? Let us know in the comments below!<\/p>\n<div style=\"display:flex; gap:10px;justify-content:\" class=\"wps-pgfw-pdf-generate-icon__wrapper-frontend\">\n\t\t<a  href=\"https:\/\/www.enago.com\/academy\/wp-json\/wp\/v2\/posts\/26916?action=genpdf&amp;id=26916\" class=\"pgfw-single-pdf-download-button\" ><img data-src=\"https:\/\/www.enago.com\/academy\/wp-content\/plugins\/pdf-generator-for-wp\/admin\/src\/images\/PDF_Tray.svg\" title=\"Generate PDF\" style=\"width:auto; height:45px;\" src=\"data:image\/svg+xml;base64,PHN2ZyB3aWR0aD0iMSIgaGVpZ2h0PSIxIiB4bWxucz0iaHR0cDovL3d3dy53My5vcmcvMjAwMC9zdmciPjwvc3ZnPg==\" class=\"lazyload\"><\/a>\n\t\t<\/div>","protected":false},"excerpt":{"rendered":"<p>No matter how brilliant a researcher you are, you must be able to write about&hellip;<\/p>\n","protected":false},"author":4,"featured_media":26964,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"_acf_changed":false,"om_disable_all_campaigns":false,"footnotes":""},"categories":[7,2],"tags":[1426,1429],"ppma_author":[1895],"class_list":["post-26916","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-language-grammar","category-academic-writing","tag-good-word-choice","tag-sentence-formation"],"better_featured_image":{"id":26964,"alt_text":"Corpora","caption":"","description":"ESL researchers can often face difficulties when constructing sentences in English for their research papers. Language Corpora help non-native researchers to understand the correct usage of words in the English language thus helping them in drafting better manuscripts. ","media_type":"image","media_details":{"width":750,"height":430,"file":"2019\/12\/LanguageCorpora.jpg","sizes":{"thumbnail":{"file":"LanguageCorpora-170x150.jpg","width":170,"height":150,"mime-type":"image\/jpeg","source_url":"https:\/\/www.enago.com\/academy\/wp-content\/uploads\/2019\/12\/LanguageCorpora-170x150.jpg"},"medium":{"file":"LanguageCorpora-401x230.jpg","width":401,"height":230,"mime-type":"image\/jpeg","source_url":"https:\/\/www.enago.com\/academy\/wp-content\/uploads\/2019\/12\/LanguageCorpora-401x230.jpg"},"large":{"file":"LanguageCorpora-750x430.jpg","width":750,"height":430,"mime-type":"image\/jpeg","source_url":"https:\/\/www.enago.com\/academy\/wp-content\/uploads\/2019\/12\/LanguageCorpora-750x430.jpg"},"publisher-tb1":{"file":"LanguageCorpora-86x64.jpg","width":86,"height":64,"mime-type":"image\/jpeg","source_url":"https:\/\/www.enago.com\/academy\/wp-content\/uploads\/2019\/12\/LanguageCorpora-86x64.jpg"},"publisher-sm":{"file":"LanguageCorpora-210x136.jpg","width":210,"height":136,"mime-type":"image\/jpeg","source_url":"https:\/\/www.enago.com\/academy\/wp-content\/uploads\/2019\/12\/LanguageCorpora-210x136.jpg"},"publisher-mg2":{"file":"LanguageCorpora-279x220.jpg","width":279,"height":220,"mime-type":"image\/jpeg","source_url":"https:\/\/www.enago.com\/academy\/wp-content\/uploads\/2019\/12\/LanguageCorpora-279x220.jpg"},"publisher-md":{"file":"LanguageCorpora-357x210.jpg","width":357,"height":210,"mime-type":"image\/jpeg","source_url":"https:\/\/www.enago.com\/academy\/wp-content\/uploads\/2019\/12\/LanguageCorpora-357x210.jpg"},"publisher-tall-sm":{"file":"LanguageCorpora-180x217.jpg","width":180,"height":217,"mime-type":"image\/jpeg","source_url":"https:\/\/www.enago.com\/academy\/wp-content\/uploads\/2019\/12\/LanguageCorpora-180x217.jpg"},"publisher-tall-lg":{"file":"LanguageCorpora-267x322.jpg","width":267,"height":322,"mime-type":"image\/jpeg","source_url":"https:\/\/www.enago.com\/academy\/wp-content\/uploads\/2019\/12\/LanguageCorpora-267x322.jpg"},"publisher-tall-big":{"file":"LanguageCorpora-368x430.jpg","width":368,"height":430,"mime-type":"image\/jpeg","source_url":"https:\/\/www.enago.com\/academy\/wp-content\/uploads\/2019\/12\/LanguageCorpora-368x430.jpg"},"Book Review":{"file":"LanguageCorpora-320x430.jpg","width":320,"height":430,"mime-type":"image\/jpeg","source_url":"https:\/\/www.enago.com\/academy\/wp-content\/uploads\/2019\/12\/LanguageCorpora-320x430.jpg"}},"image_meta":{"aperture":"0","credit":"","camera":"","caption":"","created_timestamp":"0","copyright":"","focal_length":"0","iso":"0","shutter_speed":"0","title":"","orientation":"0","keywords":[]}},"post":26916,"source_url":"https:\/\/www.enago.com\/academy\/wp-content\/uploads\/2019\/12\/LanguageCorpora.jpg"},"acf":{"faq_main_heading":"","faq_heading_one":"","faq_heading_two":"","faq_heading_three":"","faq_heading_four":"","faq_heading_five":"","faq_heading_six":"","faq_description_one":"","faq_description_two":"","faq_description_three":"","faq_description_four":"","faq_description_five":"","faq_description_six":""},"views":1310,"single_webinar_page_date":null,"single_webinar_page_time":null,"session_agenda":null,"who_should_attend_this_session":null,"about_the_speaker_field":null,"co-webinar-sec":null,"co_webinar_sec_one":null,"speaker-name":null,"webinar-date":null,"webinar-time":null,"webinar-s-image":null,"custum_webinar_category":null,"authors":[{"term_id":1895,"user_id":4,"is_guest":0,"slug":"editor","display_name":"Enago Academy","avatar_url":"https:\/\/secure.gravatar.com\/avatar\/2ef4bc47f3ceaa56f5eb3b26f9520fad298ba36ede4f86315997ffb45db37a1f?s=96&d=identicon&r=g","author_category":"","user_url":"","last_name":"Academy","first_name":"Editor","job_title":"","description":"Enago Academy, the knowledge arm of Enago, offers comprehensive and up-to-date resources on academic research and scholarly publishing to all levels of scholarly professionals: students, researchers, editors, publishers, and academic societies. It is also a popular platform for networking, allowing researchers to learn, share, and discuss their experiences within their network and community. The team, which comprises subject matter experts, academicians, trainers, and technical project managers, are passionate about helping researchers at all levels establish a successful career, both within and outside academia."}],"_links":{"self":[{"href":"https:\/\/www.enago.com\/academy\/wp-json\/wp\/v2\/posts\/26916","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.enago.com\/academy\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.enago.com\/academy\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.enago.com\/academy\/wp-json\/wp\/v2\/users\/4"}],"replies":[{"embeddable":true,"href":"https:\/\/www.enago.com\/academy\/wp-json\/wp\/v2\/comments?post=26916"}],"version-history":[{"count":0,"href":"https:\/\/www.enago.com\/academy\/wp-json\/wp\/v2\/posts\/26916\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.enago.com\/academy\/wp-json\/wp\/v2\/media\/26964"}],"wp:attachment":[{"href":"https:\/\/www.enago.com\/academy\/wp-json\/wp\/v2\/media?parent=26916"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.enago.com\/academy\/wp-json\/wp\/v2\/categories?post=26916"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.enago.com\/academy\/wp-json\/wp\/v2\/tags?post=26916"},{"taxonomy":"author","embeddable":true,"href":"https:\/\/www.enago.com\/academy\/wp-json\/wp\/v2\/ppma_author?post=26916"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}