Semantic Indexing of Unstructured Documents Using Taxonomies and Ontologies

Posted on August 27, 2013. Filed under: Uncategorized |

See on Scoop.itTaxonomy for an Enterprise

“Life Science and Healthcare organizations use RDF/SKOS/OWL based vocabularies, thesauri, taxonomies and ontologies to organize enterprise knowledge. There are many ways to use these technologies but one that is gaining momentum is to semantically index unstructured documents through ontologies and taxonomies.

In this talk we will demonstrate two projects where we use a combination of SKOS/OWL based taxonomies and ontologies, entity extraction, fast text search, and Graph Search to create a semantic retrieval engine for unstructured documents.

The first project organized all science related artifacts in Malaysia through a taxonomy of scientific concepts. It indexed all papers, people, patents, organizations, research grants, etc, etc, and created a user friendly taxonomy browser to quickly find relevant information, such as, “How much research funding has been spent on a certain subject over the last 3 years and how many patents resulted from this research”.

The second project discusses a large socio-economic content publisher that has millions of documents in at least eight different languages. Reusing documents for new publications was a painful process given that keyword search and LSI techniques were mostly inadequate to find the document fragments that were needed. Fortunately the organization had begun developing a large SKOS based taxonomy that linked common concepts to various preferential and alternative labels in many languages. We used this taxonomy to index millions of document fragments and we’ll show how we can perform relevancy search and retrieval based on taxonomic concepts.”

See on www.youtube.com

Make a Comment

Leave a Reply

Please log in using one of these methods to post your comment:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out / Change )

Twitter picture

You are commenting using your Twitter account. Log Out / Change )

Facebook photo

You are commenting using your Facebook account. Log Out / Change )

Google+ photo

You are commenting using your Google+ account. Log Out / Change )

Connecting to %s

    About

    Knowledge Management | Library Management | Sharepoint | ITIL/CMDB Professional, Bangalore, India.

    RSS

    Subscribe Via RSS

    • Subscribe with Bloglines
    • Add your feed to Newsburst from CNET News.com
    • Subscribe in Google Reader
    • Add to My Yahoo!
    • Subscribe in NewsGator Online
    • The latest comments to all posts in RSS

    Meta

Liked it here?
Why not try sites on the blogroll...

%d bloggers like this: