BEGIN:VCALENDAR
VERSION:2.0
PRODID:-//Memento EPFL//
BEGIN:VEVENT
SUMMARY:Quantitative approaches to historical texts: should you care about
  OCR? - Talk by Dr. Simon Hengchen\, University of Gothenburg
DTSTART:20201118T121500
DTEND:20201118T131500
DTSTAMP:20260504T064600Z
UID:a871182cfb8708338e50c9ace61feffb47b6e99680cac207e0d3db68
CATEGORIES:Conferences - Seminars
DESCRIPTION:Dr. Simon Hengchen\nAbstract: \nQuantitative methods for histo
 rical text analysis offer exciting opportunities for researchers intereste
 d in gaining new insights into long studied texts. However\, the methodolo
 gical underpinnings of these methods remains under-explored. In the first 
 part of the talk I will show and discuss\, through the use of a case study
 \, the effect the OCR process has on a range of quantitative text analyses
 .\nIn the second part of the talk\, I will present a novel and totally uns
 upervised OCR post-correction method on the same dataset\, as well as its 
 most recent evolution on a highly-inflected language. \n\nReferences:\nH
 ämäläinen\, M. and Hengchen\, S.\, 2019. From the Paft to the Fiiture: 
 a Fully Automatic NMT and Word Embeddings Method for OCR Post-Correction. 
 In Recent Advances in Natural Language Processing (pp. 432-437). INCOMA.\n
 Hill\, M.J. and Hengchen\, S.\, 2019. Quantifying the impact of dirty OCR 
 on historical text analysis: Eighteenth Century Collections Online as a ca
 se study. Digital Scholarship in the Humanities\, 34(4)\, pp.825-843.\n \
 nBio:\nSimon Hengchen is a researcher in NLP at the University of Gothenbu
 rg\, where he works within the Language Change project. His main research 
 focus is lexical semantic change in multilingual\, unstructured\, OCRed\, 
 historical textual data\, but he is also interested in NLP for DH. Simon i
 s also a part-time lecturer in DH at the University of Geneva.\n\n\nDH Res
 earch Seminar \nThe DH Research Seminar is a series of talks organised by 
 the Digital Humanities Institute given by researchers from a wide range of
  backgrounds and aiming at presenting the vast array of subjects covered b
 y Digital Humanities.\n\nDue to sanitary restrictions\, the DH Research Se
 minar will be given exclusively on-line during the 2020 Fall semester.\n\n
 Be sure to join\, listen to the talk and participate in the Q&A session at
  the end of the presentation.\n 
LOCATION:By Zoom https://epfl.zoom.us/j/85227198760 https://epfl.zoom.us/j
 /85227198760
STATUS:CONFIRMED
END:VEVENT
END:VCALENDAR
