News Search Using Discourse Analytics

dc.contributor.authorThompson, Paulen_US
dc.contributor.authorNawaz, Raheelen_US
dc.contributor.authorKorkontzelos, Ioannisen_US
dc.contributor.authorAnaniadou, Sophiaen_US
dc.contributor.editor-en_US
dc.date.accessioned2015-04-27T14:59:17Z
dc.date.available2015-04-27T14:59:17Z
dc.date.issued2013en_US
dc.description.abstractThe vast numbers of digitised documents containing historical data constitute a rich research data repository. However, computational methods and tools available to explore this data are still limited in functionality. Research on historical archives is still largely carried out manually. Text mining technologies offer novel methods to analyse digital content to identify various types of semantic information in these documents and to extract them as semantic metadata. Methods range from the automatic identification of named entities (e.g., people, places, organisations, etc.) to more sophisticated methods to extract information about events (e.g., births, deaths, arrests, etc.), allowing users to greatly increase the specificity of their search. We have created an extended model of event interpretation to allow searches to be refined based on various discourse facets, including isolating definite information about events from more speculative details, distinguishing positive and negative opinions and categorising events according to information source. We present ISHER as an example of a multi-faceted, semantically oriented system for searching news articles from the New York Times, dating back to 1987. We explain how our extended event interpretation model can enhance search capabilities in systems such as ISHER, including the identification of contrasting and contradictory information in news articles.en_US
dc.description.sectionheadersTrack 3, Full Papersen_US
dc.description.seriesinformationDigital Heritage International Congressen_US
dc.identifier.doi10.1109/DigitalHeritage.2013.6743801en_US
dc.identifier.urihttps://doi.org/10.1109/DigitalHeritage.2013.6743801en_US
dc.identifier.urihttps://diglib.eg.org:443/handle/10.1109/DigitalHeritage
dc.publisherThe Eurographics Associationen_US
dc.subject{Abstractsen_US
dc.subjectContexten_US
dc.subjectFilteringen_US
dc.subjectSemanticsen_US
dc.subjectText miningen_US
dc.subjectTrainingen_US
dc.subjectdiscourse analysisen_US
dc.subjectevent interpretationen_US
dc.subjecteventen_US
dc.subjectbased searchen_US
dc.subjecteventsen_US
dc.subjectsemantic metadataen_US
dc.subjectsocial historyen_US
dc.subjecttext mining}en_US
dc.titleNews Search Using Discourse Analyticsen_US
Files
Collections