Departamentos

Estudios Ingleses. Plan 2022

Grado y Doble Grado. Curso 2024/2025.

LINGÜÍSTICA COMPUTACIONAL Y DE CORPUS - 806504

Curso Académico 2024-25

Datos Generales

SINOPSIS

COMPETENCIAS

Generales
Curso Académico 2023-24
LINGÜÍSTICA COMPUTACIONAL Y DE CORPUS (LENGUA INGLESA)
Ficha DocenteFecha ficha docente: LINGÜÍSTICA COMPUTACIONAL Y DE CORPUS (LENGUA INGLESA) Página 2 de 3
COMPETENCIAS:
Generales
CG 1- Conocimiento instrumental de la lengua inglesa a un nivel elevado.
CG 2 - Conocimiento de las disciplinas del campo de la lengua inglesa.
CG 3 - Capacidad para utilizar herramientas informáticas y presentar en público productos, ideas e informes, todo ello en lengua
inglesa.
CG 4 - Capacidad para manejar, programar y comprender programas informáticos, en especial aquellos relacionados con la
gestión de bases de datos textuales, tanto en lengua inglesa como en española.
CG 5 - Capacidad para acometer el desarollo de trabajos relacionados con la gestión de bases de datos textuales, y el tratamiento
informatizado de dichos datos, con un alto grado de autonomía.
CG 6 - Desarrollo de una actitud reflexiva y crítica sobre la perspectiva computacional aplicada a la lengua inglesa.
Específicas
CE 21: El alumno será capaz de desenvolverse con cierta soltura en el manejo de herramientas informáticas y de bases de datos
textuales.

ACTIVIDADES DOCENTES

Clases teóricas
The course will combine theoretical aspects with hands-on practical activities involving computational and corpus analysis tools.
Clases prácticas
Hands-on practical activities involving computational and corpus analysis tools.

Presenciales

6

No presenciales

4

Semestre

1

Breve descriptor:

This course offers an introduction to the computational perspective in language studies as well as to techniques and tools used in the field Corpus Linguistics. Corpora, i.e.,
large archives of linguistic data (such as texts and speech transcriptions) that can be collected and analyzed using computational tools and methods. Through a combination
of lectures, demonstrations, and hands-on exercises, this course will give students an introduction to the skills necessary for computer-aided text manipulation.

The course is designed to increase awareness of computational and corpus tools and their applications for linguistic research. It provides ways to learn about various tools to at least partially automate or accelerate linguistic analysis, providing students with a greater understanding of how and when to use empirical approaches to linguistic analysis. A large emphasis of the course is in how to deal with large amounts of language data and to
understand practical issues in dealing with corpora, annotation and multi-lingual data. It covers computer methods for doing linguistics with on-line corpora.

Requisitos

Advanced computational skills to undertake the use and programming of computer tools and analyses. Students without adequate computer skills (including some basic programming skills) and in the first courses of the English Studies Degree should NOT register for this course.

Objetivos

At the end of the course, students will have learnt basic aspects of the computational perspective on language, focusing on the preprocessing, markup and annotation of English computer corpora, as well as on the handling of corpus analysis and query tools for the extraction of linguistic information from machine-readable datasets.

Contenido

Unit 1: Introductory Issues: Definitions and approaches. What is a corpus? What is corpus
linguistics?
Unit 2: Corpus types and corpus design
Unit 3: Corpus data: processing and compilation
Unit 4: Corpus querying (concordancing)
Unit 5: Querying mega corpora with online interfaces

Evaluación

The final grade will be calculated according to the percentages below. The participation
grade includes attending class regularly and participating actively. Other in-class
activities (exercises, group discussions, written questions, etc.) will also count towards
your participation grade. There will be one final project assignment which will be graded
and computed as 60% of the final grade.

Participation and activities 40%
Assignments 60%

Bibliografía

McEnery, T. and A. Hardie (2012). Corpus Linguistics: Method, Theory and Practice. Cambrdige University
Press.
McEnery, McEnery, T., Xiao, R. and Tono, Y. (2006) Corpus based language studies. London: Routledge.
Martin Wynne (ed.). 2005. Developing Linguistic Corpora: A Guide to Good Practice. Oxbow books.
Biber, Douglas. 1993. Representativeness in corpus design. Literary and Linguistic Computing8(4). 243–
257.
Fillmore, Charles J. 1992. 'Corpus linguistics' or 'computer-aided armchair linguistics'. In Jan Svartvik
(ed.), Directions in Corpus Linguistics. Proceedings of Nobel Symposium 82, Stockholm, 4-8 August 1991,
35–60. Berlin and New York: Mouton de Gruyter.
Hunston, Susan. 2008. Collection strategies and design decisions. In Anke Lüdeling & Merja Kytö
(eds.), Corpus Linguistics. An International Handbook, 154–168. Berlin: De Gruyter.

Otra información relevante

Optional readings:
Biber, Douglas & James K. Jones. 2009. Quantitative methods in corpus linguistics. In Anke Lüdeling
& Merja Kytö (eds.), Corpus Linguistics. An International Handbook. Vol. 2, 1286–1304. Berlin:
Mouton de Gruyter.
Kilgarriff, Adam. 2005. Language is never, ever, ever, random. Corpus Linguistics and Linguistic
Theory 1(2). 263–275.
Kübler, Sandra & Heike Zinsmeister. 2015. Corpus Linguistics and Linguistically Annotated Corpora.
London: Bloomsbury.
McEnery, Tony, Richard Xiao & Yukio Tono. 2006. Corpus-Based Language Studies: An Advanced
Resource Book. (Routledge Applied Linguistics.) London and New York: Routledge. Sampson, Geoffrey.
2013. The Empirical Trend. Ten Years on. International Journal of Corpus Linguistics 18(2), 281–289.
Santorini, Beatrice. 1990. Part-of-Speech Tagging Guidelines for the Penn Treebank Project(3rd
Revision). University of Pennsylvania, Technical Report.
Weisser, Martin. 2016. Practical Corpus Linguistics: An Introduction to Corpus-Based Language
Analysis. Oxford: Wiley Blackwell.
Zeldes, Amir. 2018. Multilayer Corpus Studies. (Routledge Advances in Corpus Linguistics 22.)
London: Routledge

Estructura

MódulosMaterias
No existen datos de módulos o materias para esta asignatura.

Grupos

Clases teóricas y/o prácticas
GrupoPeriodosHorariosAulaProfesor
Grupo A12/09/2024 - 13/12/2024JUEVES 12:30 - 14:30A-LAB 011MARIA JULIA LAVID LOPEZ
VIERNES 12:30 - 14:30A-LAB 011MARIA JULIA LAVID LOPEZ
Grupo T27/01/2025 - 09/05/2025JUEVES 15:00 - 17:00A-LAB 007LARA MORATON GUTIERREZ
VIERNES 15:00 - 17:00A-LAB 007LARA MORATON GUTIERREZ