专利名称:System and Method for Creating and
Maintaining a Database of DisambiguatedEntity Mentions and Relations from a Corpusof Electronic Documents
发明人:Michael A. Woytowitz,Marshall Wells Hawks申请号:US132092申请日:20110809
公开号:US20120197862A1公开日:20120802
专利附图:
摘要:Method and apparatus for creating an electronic database of disambiguated
entity mentions and relations from a corpus of electronic documents. The inventionautomatically extracts from the corpus of electronic documents mentions about entities(e.g., references to people, organizations or places), parses the entity mentions into“mention objects,” and executes a series of grouping, comparison and hierarchical fuzzyobject clustering algorithms to cluster together in an electronic database all of themention objects referring to the same entity and all of the mention objects (e.g.
“people”) associated with each other by a relationship (e.g., “co-authors” or “familymembers”). The resulting electronic database of disambiguated entity mentions andrelations, which may comprise, for example, an XML document, a relational database orhierarchical database, is structured to permit useful recordation, access, review anddisplay of all of the mentions and relations associated with a particular entity orcollection of entities.
申请人:Michael A. Woytowitz,Marshall Wells Hawks
地址:Freeland MD US,Upperco MD US
国籍:US,US
更多信息请下载全文后查看
因篇幅问题不能全部显示,请点此查看更多更全内容
Copyright © 2019- yrrf.cn 版权所有 赣ICP备2024042794号-2
违法及侵权请联系:TEL:199 1889 7713 E-MAIL:2724546146@qq.com
本站由北京市万商天勤律师事务所王兴未律师提供法律服务