Snabbfakta
-
- Paris
Ansök senast: 2025-01-31
Research Engineer in Data Production, Processing, and Analysis
Organisation/Company SORBONNE UNIVERSITE Department Faculté des Lettres Research Field Literature » Literary criticism Language sciences » Philology Researcher Profile Recognised Researcher (R2) Positions Research Support Positions Country France Application Deadline 28 Jan 2025 - 12:00 (Europe/Paris) Type of Contract Temporary Job Status Part-time Offer Starting Date 3 Feb 2025 Is the job funded through the EU Research Framework Programme? Horizon Europe - ERC Reference Number 101141778 Is the Job related to staff position within a Research Infrastructure? No
Offer Description
Digital Processing of Grammatical Quotations in the Corpus of Latin Grammarians
Where to apply
Requirements
Research Field Literature » European literature Education Level PhD or equivalent
Skills/Qualifications
- Organization of research and higher education in France and abroad
- Knowledge of ancient languages (Latin, Greek) and their digital processing
- Knowledge of XML and TEI standards
- Proven knowledge of data science, natural language processing, and applied artificial intelligence
Under the direction of the Principal Investigator, the Research Engineer will identify and link literary and grammatical citations across the entire corpus of source texts. While the literary citations in the normative manuals and works on meter from the CGL online corpus (Garcea & Lomanto ed. 2022) have already been identified, the grammatical citations have not yet been. Neither grammatical nor literary citations in the commentaries and glossographic works have ever been analyzed as a whole in the field of digital humanities, and their exact number remains unknown: literary citations in Festus are estimated at 1,560 (North 2007: 49), in Servius at around 1,500 (Pellizzari 2003: 222-245), and in Nonius at 7,176 (Cadoni 1987: 15). This task presents a major methodological challenge, as both grammatical and literary citations need to be collected in such a way that each citation is presented in all its forms and contexts.
The XML-TEI structuring will be carried out using the Teinte software developed by the Sorbonne team ObTIC, a tool that enables the mass and automatic conversion of OCR documents into XML-TEI format. Annotation will be performed using the open-source application INCEpTION, and the resulting reference sub-corpora will be adapted for use with machine learning systems to identify these citations in new texts.
Languages LATIN Level Good
Languages GREEK Level Basic
Research Field Literature » European literature Years of Research Experience 4 - 10
Additional Information
Selection process
Required Documents:
- A cover letter (in French and/or English) addressed to the Principal Investigator, explaining the applicant's interest in this research and the skills they intend to bring to it.
- A detailed CV.
- A copy of the document certifying the required level of education.
Selection Criteria:
- A PhD in Classical Studies, Linguistics of Ancient Languages, Ancient Languages and Literatures, or Ancient Philosophy.
- Proven skills in computational linguistics and digital humanities.
- An interest in the history of ancient linguistic thought.
- Proficiency in both French and English.
- Strong organizational skills.
Selection Process:
The required documents must be sent to the Principal Investigator, Alessandro Garcea, at .
Shortlisted candidates will be invited for an interview, which can be conducted in person in Paris or remotely.