Web Intelligence Technologies

Natural Language Processing Group, Department of Computer Science, University of Sheffield

Tools

Armadillo

DOAP

A tool for Semantically Mining large collections of documents.

Fabio Ciravegna, Sam Chapman, Alexiei Dingli, Yorick Wilks: Learning to Harvest Information for the Semantic Web Proceedings of the 1st European Semantic Web Symposium, Heraklion, Greece, May 10-12, 2004

T-Rex

DOAP

Technology for adaptive IE from texts and for document classification. Infrastructure for document representation also beyond texts.

Amilcare

Technology for adaptive IE from texts.

Fabio Ciravegna and Yorick Wilks: Designing Adaptive Information Extraction for the Semantic Web in Amilcare, in S. Handschuh and S. Staab (eds), Annotation for the Semantic Web, in the Series Frontiers in Artificial Intelligence and Applications by IOS Press, Amsterdam, 2003.

Melita

Text Annotation Tool for the Semantic Web assisted by Adaptive IE.

Fabio Ciravegna, Alexiei Dingli, Daniela Petrelli and Yorick Wilks: User-System Cooperation in Document Annotation based on Information Extraction in Asuncion Gomez-Perez, V. Richard Benjamins (eds.): Knowledge Engineering and Knowledge Management (Ontologies and the Semantic Web), Proceedings of the 13th International Conference on Knowledge Engineering and Knowledge Management (EKAW02), 1-4 October 2002 - Sigüenza (Spain), Lecture Notes in Artificial Intelligence 2473, Springer Verlag

Knowledge Integration Methodologies

SimMetrics

DOAP

SimMetrics is a Java & C# .NET library of Similarity Metrics, e.g. edit distance's (Levenshtein, Gotoh, Smith-Waterman etc) also other measures, (e.g Soundex). All metrics return normalised measures as well as standard similarity algorithm output scores

Knowledge Sharing and Reuse

AKTiveDoc

Document Editing and browsing tool for the Semantic Web

Vitaveska Lanfranchi, Fabio Ciravegna, Daniela Petrelli: Semantic Web-based Document: Editing and Browsing in AktiveDoc, Proceedings of the 2nd European Semantic Web Conference , Heraklion, Greece, May 29-June 1, 2005

AKTiveMedia

DOAP

Tool for user-centred semi-automatic multimedia annotation. It also provides knowledge sharing capabilities for presenting semantic web annotated documents and search results.

Ajay Chakravarthy, Vita Lanfranchi and Fabio Ciravegna: 
Cross-media Document Annotation and Enrichment, SAAW2006 - 1st Semantic Authoring and Annotation Workshop, The 5th International Semantic Web Conference (ISWC2006), Athens, GA, USA, Monday, November 6th 2006.

K-Search

Search tool for distributed repositories (mainly RDF, but also traditional databases) based on Hybrid Search Technology. Patent is pending.

Vitaveska Lanfranchi, Ravish Bhagdev, Sam Chapman, Fabio Ciravegna, Daniela Petrelli: 
Extracting and Searching Knowledge for the Aerospace Industry 
in Proceedings of 1st European Semantic Technology Conference, Vienna May31, June 1 2007.

AktiveForm

Ontology-based environment for development and release of human centred knowledge capture applications. An ontology drives development of the knowledge capture strategy (realised as a set of HTML forms). At application time it enables capture of knowledge in the form of RDF statements. The extracted knowledge is then passed to search mechanisms (e.g. K-Search) or to create virtual documents.

Runes

Runes is a plugin-based framework that reverses the way of thinking about data processing: specify data accesses, and Runes will automatically choose an efficient representation for the data and execute the plugins to integrate it from different sources.

Reference: J. Iria and F. Ciravegna. A Methodology and Tool for Representing Language Resources for Information Extraction. In Proceedings of the 5th International Conference on Language Resources and Evaluation (LREC 2006), Genoa, 24-25-26 May, 2006.

Saxon

Environment for IE rule writing based on finite state automata. It works on top of Runestone