Estudios Ingleses. (Plan a extinguir)
Undergraduate Programme. Academic Year 2024/2025.
LINGÜÍSTICA COMPUTACIONAL Y DE CORPUS (LENGUA INGLESA) - 802231
Curso Académico 2024-25
Datos Generales
- Plan de estudios: 0835 - GRADO EN ESTUDIOS INGLESES (2009-10)
- Carácter: Optativa
- ECTS: 6.0
SINOPSIS
COMPETENCIAS
Generales
CG 2 - Conocimiento de las disciplinas del campo de la lengua inglesa.
CG 3 - Capacidad para utilizar herramientas informáticas y presentar en público productos, ideas e informes, todo ello en lengua inglesa.
CG 4 - Capacidad para manejar, programar y comprender programas informáticos, en especial aquellos relacionados con la gestión de bases de datos textuales, tanto en lengua inglesa como en española.
CG 5 - Capacidad para acometer el desarollo de trabajos relacionados con la gestión de bases de datos textuales, y el tratamiento informatizado de dichos datos, con un alto grado de autonomía.
CG 6 - Desarrollo de una actitud reflexiva y crítica sobre la perspectiva computacional aplicada a la lengua inglesa.
Específicas
ACTIVIDADES DOCENTES
Clases teóricas
Clases prácticas
Laboratorios
Presenciales
No presenciales
Semestre
Breve descriptor:
This course offers an introduction to the computational perspective in language studies as well as to techniques and tools used in the field Corpus Linguistics. Corpora, i.e.,
large archives of linguistic data (such as texts and speech transcriptions) that can be collected and analyzed using computational tools and methods. Through a combination
of lectures, demonstrations, and hands-on exercises, this course will give students an introduction to the skills necessary for computer-aided text manipulation.
The course is designed to increase awareness of computational and corpus tools and their applications for linguistic research. It provides ways to learn about various tools to at least partially automate or accelerate linguistic analysis, providing students with a greater understanding of how and when to use empirical approaches to linguistic analysis. A large emphasis of the course is in how to deal with large amounts of language data and to understand practical issues in dealing with corpora, annotation and multi-lingual data. It covers computer methods for doing linguistics with on-line corpora.
Requisitos
Objetivos
annotation of English computer corpora, as well as on the handling of corpus analysis and query tools for the extraction of linguistic information from machine-readable
datasets.
Contenido
Unit 1: Introductory Issues: Definitions and approaches. What is a corpus? What is corpus linguistics?
Unit 2: Corpus types and corpus design
Unit 3: Corpus data: processing and compilation
Unit 4: Corpus querying (concordancing)
Unit 5: Querying mega corpora with online interfaces
Evaluación
grade includes attending class regularly and participating actively. Other in-class
activities (exercises, group discussions, written questions, etc.) will also count towards
your participation grade. There will be one final project assignment which will be graded
and computed as 60% of the final grade.
Participation and activities 40%
Assignments 60%
Bibliografía
Press.
McEnery, McEnery, T., Xiao, R. and Tono, Y. (2006) Corpus based language studies. London: Routledge.
Martin Wynne (ed.). 2005. Developing Linguistic Corpora: A Guide to Good Practice. Oxbow books.
Biber, Douglas. 1993. Representativeness in corpus design. Literary and Linguistic Computing8(4). 243
257.
Fillmore, Charles J. 1992. 'Corpus linguistics' or 'computer-aided armchair linguistics'. In Jan Svartvik
(ed.), Directions in Corpus Linguistics. Proceedings of Nobel Symposium 82, Stockholm, 4-8 August 1991,
3560. Berlin and New York: Mouton de Gruyter.
Hunston, Susan. 2008. Collection strategies and design decisions. In Anke Lüdeling & Merja Kytö
(eds.), Corpus Linguistics. An International Handbook, 154168. Berlin: De Gruyter.
Otra información relevante
Biber, Douglas & James K. Jones. 2009. Quantitative methods in corpus linguistics. In Anke Lüdeling
& Merja Kytö (eds.), Corpus Linguistics. An International Handbook. Vol. 2, 12861304. Berlin:
Mouton de Gruyter.
Kilgarriff, Adam. 2005. Language is never, ever, ever, random. Corpus Linguistics and Linguistic
Theory 1(2). 263275.
Kübler, Sandra & Heike Zinsmeister. 2015. Corpus Linguistics and Linguistically Annotated Corpora.
London: Bloomsbury.
McEnery, Tony, Richard Xiao & Yukio Tono. 2006. Corpus-Based Language Studies: An Advanced
Resource Book. (Routledge Applied Linguistics.) London and New York: Routledge. Sampson, Geoffrey.
2013. The Empirical Trend. Ten Years on. International Journal of Corpus Linguistics 18(2), 281289.
Santorini, Beatrice. 1990. Part-of-Speech Tagging Guidelines for the Penn Treebank Project(3rd
Revision). University of Pennsylvania, Technical Report.
Weisser, Martin. 2016. Practical Corpus Linguistics: An Introduction to Corpus-Based Language
Analysis. Oxford: Wiley Blackwell.
Zeldes, Amir. 2018. Multilayer Corpus Studies. (Routledge Advances in Corpus Linguistics 22.)
London: Routledge
Estructura
Módulos | Materias |
---|---|
PROFUNDIZACIÓN EN LENGUA Y LINGÜÍSTICA INGLESA | LINGÜÍSTICA INGLESA APLICADA |
Grupos
Clases teóricas y/o prácticas | ||||
---|---|---|---|---|
Grupo | Periodos | Horarios | Aula | Profesor |
Grupo A | 12/09/2024 - 13/12/2024 | JUEVES 12:30 - 14:30 | A-LAB 011 | MARIA JULIA LAVID LOPEZ |
VIERNES 12:30 - 14:30 | A-LAB 011 | MARIA JULIA LAVID LOPEZ | ||
Grupo T | 27/01/2025 - 09/05/2025 | JUEVES 15:00 - 17:00 | A-LAB 007 | LARA MORATON GUTIERREZ |
VIERNES 15:00 - 17:00 | A-LAB 007 | LARA MORATON GUTIERREZ |