AUTOMATIC RETRIEVAL AND THE FORMALIZATION OF MULTI WORDS EXPRESSIONS WITH F-WORDS IN THE CORPUS OF CONTEMPORARY AMERICAN ENGLISH

https://doi.org/10.22146/jh.8709

Prihantoro Prihantoro(1*)

(1) Faculty of Humanities Diponegoro University Semarang, Indonesia
(*) Corresponding Author

Abstract


The research problems in this research are 1) how lexicogrammar takes role in determining polarity of F-Word1 and 2) how to formalize it for corpus processing. The data is obtained from the Contemporary American English Corpus (COCA). In this corpus, F-word is proven to be highest in frequency as compared to its distribution across corpora. Corpus methodology is applied by sending queries to retrieve F-Words to COCA interface. Tokens combination surrounding F-words resulted in the phrase and clause unit accompanying F-words, which are significant cues to determine F-word polarity. The polarity is later proven to be not necessarily negative. I also designed a computational resource to allow the retrieval of F-words offline so that users might apply it to any digital text collections.


Keywords


F-words, Corpus, Information Retrieval, American English, Polarity

Full Text:

PDF


References

Allan, K., & Burridge, K. (2006). Forbidden Words: Taboo and The Censoring of Language. Cambridge: Cambridge University Press.

Annelie, A., & Erman, B. (2012). Recurrent word combinations in academic writing by native and non-native speakers of English: A lexical bundles approach. English for specific purposes 31.2, 81-92.

Brewster, S. (2013). Saying the ‘F word… in the nicest possible way’: augmentative communication and discourses of disability. Disability & Society, 28(1), 125-128.

Busthomi, Y. (2010). On Dirty Words: An Attempt to Intimate The Conundrum Of The Sin. Bahasa dan Seni, 1-9.

Chen, Y.-H., & Baker, P. (2010). Lexical Bundles In L1 And L2 Academic Writing. Language Learning & Technology (2), 30-49.

Chiorean, S. (2014). The Case of Fuck in Corpus of Contemporary American English (Thesis). Stockholm: Stockholm University Press.

Davies, M. (2008). American Corpus. Retrieved from The corpus of contemporary American English (COCA): 385 million words, 1990-present: http://www.americancorpus.org

Davies, M. (2010). The Corpus of Contemporary American English as the First Reliable Monitor Corpus of English. Language and Literary Computing, 447-464.

Fairman, C.-M. (2007). Fuck. Cardozo Law Review, 28(4), 1711-1772.

Firth, J. (1957). Papers in linguistics 1934–1951. London: Oxford University Press.

Harding, N., Ford, J., & Fotaki, M. (2013). Is the ‘F’- word still dirty? A past, present and future of/ for feminist and gender studies in Organization. Organization, 20(1), 51-65.

Hobbs, P. (2012). Fuck as a metaphor for male sexual aggression. Gender and Language, 7(2), 149- 176.

Howe, R. (2012). The Use of Fuck: A Sociolinguistic Approach to the Usage of Fuck in the BNC and Blog Autorship Corpus. Michigan: Michigan University Press.

Hughes, G. (2006). Encyclopedia of Swearing. New York: M.E Sharpe Inc.

Hymes, D.-H. (1962). The ethnography of Speaking. In T. G. (eds), Anthropology and Human Behaviour. Washington, D. C: Anthropology Society of Washington.

Jaworska, S., & Krishnamurthy, R. (2012). On the F word: A corpus-based analysis of the media representation of feminism in British and German press discourse, 1990–2009. Discourse & Society 23(4), 401-431.

Johnson, S. (2003). Samuel Johnson’s Dictionary. London: Walker & Company.

Kravis, N. (2013). Fuck Redux A Review And Commentary. Journal of the American PsychoanalyticA ssociation, 61(3), 527-534.

Leigh, M., & Lepine, M. (2005). Advanved Swearing Handbook. Sussex: Sumersdale Publisher.

Ludeling, A., & Kyto, M. (2009). Corpus Linguistics: An International Handbook. Berlin and New York: Walter de Gruyter.

Mc Enery, T., & Hardie, A. (2012). Corpus Linguistics: Method, Theory and Practice. Cambridge: Cambridge University Press.

McEnery, A., & Xiao, Z. (2004). Swearing in modern British English: the case of fuck in the BNC. Language and Literature 13(3), 235-268.

McEnery, A., Baker, J.-P., & Hardie, A. (2000). Assessing claims about language use with corpus data: Swearing and abuse. Language and Computers, 30, 45-56.

McEnery, T. (2006). Swearing in English. New York: Routledge.

Nadar, F., Wijana, I., Poedjosoedarmo, S., & Djawanai, S. (2005). Penolakan dalam Bahasa Inggris dan Bahasa Indonesia. Humaniora (17), 166-178.

Norrick, N.-R. (2012). Swearing in literary prose fiction and conversational narrative. Narrative Inquiry, 22(1), 24-49.

O’Keefe, A., & McCarthy, M. (2010). The Routledge Handbook of Corpus Linguistics. Abingdon: Routledge.

Olive, R., & Thorpe, H. (2011). Negotiating the ‘F-word’in the field: Doing feminist ethnography in action sport cultures. Sociology of Sport Journal 28(4), 421-440.

Pang, W. (2010). Lexical Bundles and the Construction of an Academic Voice:A Pedagogical Perspective. Asian EFL Journal, Vol 47.

Paumier, S. (2008). Unitex Manual. Paris: Universite Paris Est Marne La Valee & LADL.

Sheidlehower, J. (2009). The F Word. Oxford: Oxford University Press.

Sparrow, J. (1962). Regina V Penguin-Books-Ltd-An Undisclosed Element In The Case. Encounter 18(2), 35-43. 169



DOI: https://doi.org/10.22146/jh.8709

Article Metrics

Abstract views : 984 | views : 1058

Refbacks

  • There are currently no refbacks.




Copyright (c) 2016 Prihantoro Prihantoro

Creative Commons License
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.


About Us

Editorial Team

Journal History

Contact

Publish with Us

Guidelines

Submission

Register Account

Explore

Current Issue

Archives

Our Statistics


Humaniora (Online ISSN 2302-9629; SINTA-2 Indexed) is published by the Faculty of Cultural Sciences, Universitas Gadjah Mada. The layout and the content of this OJS 2 website are customized by and attributable to Moh. Masruhan and Yoga Adwidya, and are licensed under a Creative Commons Attribution-ShareAlike 4.0 International Licence (CC BY-SA).
web counter Humaniora's Stats