|
Parts of Speech
|
Parts of Speech searchesEvery word in the CCS has its part of speech (POS) marked, or tagged. You can imagine that tagging every word in a multi-million word corpus is a daunting job. In fact, it is done by computers with an estimated accuracy rate of 95%. Being able to search for a word by POS is often essential. For example, the word form ROSE can be a flower or the past tense of rise. Mixed search results will not be helpful. Some POS issues have been deliberately avoided in CCS, such as the use of some participles as adjectives: winning is not marked as an adjective in “winning smile” and failed is not marked as an adjective in “a failed bank”, but homing is in “homing device”. Also above and fast cannot be located using the adverb tag. Nevertheless, it still provides a view on language that I couldn't imagine finding out about in any other way (apologies to Jan Svartvik).
The query syntax is: the word, a slash and the tag in CAPITAL LETTERS. Here is list CCS POS tags. Other corpora have different tags, and other concordancing programs have different ways of forming a query. Table of POS tagsThis is an expansion of the information provided on the Cobuild site below the concordance entry box.
Note: these POS tags become much more powerful when used in combination as we see in Session 5.
Refining your searchesYou can now refine the searches you tried in the previous session. Lemma: Try a search on peer, which is a proper noun, and has two meanings as a common noun. It is also a verb. Try peer@. Try peer@/NOUN and peer@/VERB Word Family: Try preten*/JJ. and see adjectives starting with “preten”. Try prohib*/RB and you will see the derived adverb. Try contra*/NNS and you will see quite a few nouns in the plural that start this way. What nouns are in the contract family? Try contract*/NOUN What adjectives derive from oil? Search oil*/JJ. What adjectives derive from club? club*/JJ
|