spacer spacer spacer spacer spacer
spacer UMBC ebiquity spacer spacer spacer spacer spacer
spacer spacer spacer spacer spacer spacer spacer spacer spacer spacer spacer
spacer spacer spacer spacer spacer
spacer spacer spacer spacer spacer
spacer
Computing word and phrase similarity
spacer
« Public tutorials on high performance computing research and technologies
What will replace Big Data as a hot buzzword? »

Computing word and phrase similarity

Tim Finin, 7:32am 10 January 2013
Tweet

spacer

Computing semantic similarity between words and phrases has important applications in natural language processing, information retrieval, and artificial intelligence. There are two prevailing approaches to computing word similarity, based on either using of a thesaurus (e.g., WordNet) or statistics from a large corpus. We provide a hybrid approach combining the two methods that is demonstrated on a web site through two services: one that returns a similarity score for two words or phrases and another that takes a word and shows a ranked list of the most similar words.

Our statistical method is based on distributional similarity and Latent Semantic Analysis. We further complement it with semantic relations extracted from WordNet. The whole process is automatic and can be trained using different corpora. We assume the semantics of a phrase is compositional on its component words and apply an algorithm to compute similarity between two phrases using word similarity.

The algorithms, implementation and data for this work were developed by Lushan Han as part of his research on developing easier ways to query linked open data collections. It was supported by grants from AFOSR (FA9550-08-1-0265), NSF (IIS-1250627) and a give from Microsoft. Contact umbcsim at cs.umbc.edu for more information.


Categories: AI, Machine Learning, NLP, Semantic Web Comments: Comments Off

Comments are closed.

spacer
spacer Ebiquity Blog
  Home | Archive | Login | Feed spacer register -->


spacer Ebiquity Recent Posts
  • Wikidata article in CACM
  • Responsive design with Twitter Bootstrap: a tutorial and demonstration
  • Infoboxer: using statistical semantic knowledge to help create Wikipedia infoboxes
  • Rafiki: A Semantic and Collaborative Approach to Community Health-Care in Underserved Areas
  • Taming Wild Big Data

  • spacer Ebiquity rdfs:seeAlso
  • schema blog: Introducing 'Role'
  • Google Custom Search
  • Apache Any23: Anything To Triples - Service 1.1-SNAPSHOT (UNKNOWN@r${buildNumber}; 2014-05-18 21:42:16 0000)
  • VOWL: Visual Notation for OWL Ontologies
  • An introduction to Semantic Web and Linked Data
  • Graphical Ontology Editor ยท OWLGrEd
  • schema blog
  • Landmark Steps to Liberate Open Data

  • spacer Ebiquity on Flickr

    spacer Ebiquity community
  • AISL
  • Assured Information Sharing
  • Harry Chen thinks aloud
  • Journal of Web Semantics
  • Search, Spam, Social Media
  • UMBC CSEE
  • UMBC GAIM

  • spacer Ebiquity tags
    AI Games Policy workshop RDF provenance advertising Journal of Web Semantics data social network RDFa social networking obama darpa LOD Yahoo iswc IBM voting JWS cloud computing spam Semantic Web Microsoft Twitter Python Google Social media linked data Facebook
    spacer

    UMBC  home · contact · about · site map · legal · privacy · ©1999-2014 ebiquity · design/code ©2003-2014 Filip Perich · XG

    spacer
    gipoco.com is neither affiliated with the authors of this page nor responsible for its contents. This is a safe-cache copy of the original web site.