Stalker: overcoming linguistic barriers in open source intelligence
Federico Neri,
Paolo Geraci and
Massimo Pettoni
International Journal of Networking and Virtual Organisations, 2011, vol. 8, issue 1/2, 37-51
Abstract:
The revolution in information technology is making open sources more accessible, ubiquitous and valuable. The international intelligence communities have seen open sources become increasingly easier and cheaper to acquire in recent years. But up to 80% of electronic data is textual and most valuable information is often hidden and encoded in pages which are neither structured nor classified. The process of accessing all these raw data, heterogeneous in terms of source and language, and transforming them into information is therefore strongly linked to automatic textual analysis and synthesis, which are greatly related to the ability to master the problems of multilinguality. This paper describes a content-enabling system that provides deep semantic search and information access to large quantities of distributed multimedia data for both experts and the general public. Stalker provides a language-independent search and dynamic classification features for a broad range of data collected from several sources in a number of culturally diverse languages.
Keywords: focused crawling; natural language processing; morpho-syntactic analysis; functional role labelling; semantic analysis; supervised clustering; unsupervised clustering; linguistic barriers; Stalker; ICT; information technology; communications technology; intelligence communities; electronic data; hidden information; encoded information; unstructured information; unclassified information; raw data; heterogeneous data; textual analysis; textual synthesis; multilinguality; content-enabling systems; deep searching; distributed multimedia; language-independent searches; dynamic classification; cultural diversity; languages; networks; virtual organisations; web based organisations; online organisations; open source intelligence; web mining; world wide web; internet. (search for similar items in EconPapers)
Date: 2011
References: Add references at CitEc
Citations:
Downloads: (external link)
http://www.inderscience.com/link.php?id=37160 (text/html)
Access to full text is restricted to subscribers.
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:ids:ijnvor:v:8:y:2011:i:1/2:p:37-51
Access Statistics for this article
More articles in International Journal of Networking and Virtual Organisations from Inderscience Enterprises Ltd
Bibliographic data for series maintained by Sarah Parker ().