- About Us
- Services
- Stories
- Faisal Beg Algorithms to Advance Research in Medicine
- Yasutaka Furukawa Smart Building Technologies to Enhance Living Spaces and Create Opportunities
- Mo Chen AI to Create Safe and Practical Robotics
- Sheelagh Carpendale Understanding Data Through Interaction and Visualization
- Innovation to Improve 3D Navigation
- Voice AI is Helping Shoppers Make Better Decisions
- Geographic Information Science Can Help Better Track COVID-19
- Deep Learning to Inform Medical Diagnoses
- Protecting Killer Whales from Marine Traffic
- Using Big Data to Boost Athletic Performance
- Machine Reading for Literary Texts
- Finding a Cure for HIV with Big Data
- Linked Data for Women's History
- How Big Data Can Combat Fake News
- Algorithms for Safer Streets
- Discovering Wilde Data
- Deep Blue Data
- Big Data Meets Big Impact
- Previous Next Big Question Fund Projects
- Data Fellowships
- Using Data
- Upcoming Events
Machine Reading for Literary Texts
Machine Reading for Literary Texts
Project Team: Margaret Linley (English, 間眅埶AV), Oliver Schulte (Computing Science, 間眅埶AV), Maite Taboada (Linguistics, 間眅埶AV), Steven Bergner (Computing Science, 間眅埶AV)
For many researchers, much if not most information about their domain is available in unstructured format only. Examples include literary text, web pages, free form comments and reports. Restricting data analysis to structured data limits the potential of big data methods. The process of extracting structured information from unstructured data is called machine reading. Machine reading supports research in the digital humanities, such as distant reading approaches, which aim to find statistical patterns in collections of literary texts, track how ideas, genres, topics, and even moods and emotions circulate, and extract relationship networks of characters. This project studies the Lake District travel books hosted by 間眅埶AVs special collections. The team will investigate different machine reading systems for the books themselves, as well as webpages that describe the literary, historical, geographical, and cultural context of the travel books. This project will build machine reading expertise at Big Data Initiative and 間眅埶AV.