CAMBRIDGE, Mass., Feb. 21 / -- Teragram, the leading provider of multilingual natural language processing technologies, today announced that its proprietary categorization software is being used to enable auto-categorization of content on the Homeland Security Digital Library (HSDL) at http://www.hsdl.org/.
The HSDL is an important, first-of-its-kind national security resource whose mission is to make a vast number of electronic documents (in PDF, video and audio formats) featuring key strategy, policy and organizational research available to a wide user base of local, state and federal homeland security policy makers.
In the fall of 2004, the HSDL development team began licensing Teragram's categorization and concepts extraction software package, which was closely integrated with search engine technology.
Teragram's technology enables the HSDL to automatically categorize documents saved in multiple formats for easy search and retrieval. Sample topic areas include Law and Justice, Borders and Immigration, Infrastructure Protection, Terrorism and Society, Weapons and Weapons Systems, Emergency Management, and Public Health.
"We are proud to be partnering with the Homeland Security Digital Library to further their important work, enabling developers to consistently assign taxonomy terms to content for easier search and by automating the Herculean task of updating the library's overall taxonomy structure on a quarterly basis," said Dr. Yves Schabes, Ph.D., president and co-founder of Teragram Corporation.
How Teragram's Technology Works in the HSDL
Each day, HSDL content developers continuously add new documents in PDF, video and audio formats to the system, and Teragram's software ensures that these documents are properly categorized and available to approved users.
Teragram worked with the Library to develop a semantic, rules-based model for auto-categorization that allows the HSDL content development teams to manage rapidly growing content, providing a far greater level of recall and precision in portal browse and search applications. In addition, as national discourse shifts and document topics change, content managers can utilize reports generated by the auto-categorizer to determine whether documents are being categorized as expected. Any changes needed, such as adding new terminology or making rules more inclusive or exclusive, can then be made instantly by the Library's taxonomy developers.
Teragram's rapid categorization technology also adds flexibility to the HSDL's new system. Together, the system enables full-text and metadata searchability and metadata-determined relevancy across the entire Library collection. In addition, the Library's Web interface utilizes Teragram's categorization engine to build filtered, context-specific search navigation menus in its search and browse functions.
"Teragram's technology has helped bring HSDL to where it is today," said Thomas M. Mastre, HSDL Project Manager.
About the Homeland Security Digital Library
Launched in September 2005, The Homeland Security Digital Library (HSDL) is the primary online research tool for the faculty and students of the Center for Homeland Defense and Security (CHDS), operating out of the Naval Postgraduate School and sponsored by the Department of Homeland Security's (DHS) Office of Grants and Training (OG&T).
The HSDL's mission is to make electronic documents featuring key strategy, policy and advanced research available to a wide user base of local, state and federal homeland security policy makers. The scope of the project is vast -- and includes capturing the current debate on specific homeland security topics ranging from disaster management to border security, to epidemiology, to intergovernmental relations, to weapons and technology.
Teragram Corporation is the market leader in multilingual natural language processing technologies that use the meaning of text to distill relevant information from vast amounts of data. Founded in 1997 by innovators in the field of computational linguistics, Teragram alone offers the speed, accuracy and global language support that customers and partners demand to retrieve and organize growing volumes of digital information. Teragram helps customers perform more efficient searches and better organize information in more than 30 languages, enabling them to reach new markets and make better decisions. Teragram serves customers across the publishing, pharmaceutical, telecommunications and financial industries, including Ariba, Ask Jeeves, Associated Press, CNN, Factiva, FAST Search & Transfer, Forbes.com, InfoSpace, Kofax, Naval Post Graduate School, NYTimes Digital, OneSource, Reed Business Information, Ricoh, Sony, Verity, WashingtonPost.com, the World Bank, and Yahoo. For more information please contact 1-617-576-6800 or visit http://www.teragram.com/info.
Source: Teragram Corporation
CONTACT: Doug Russell or Chuck Kabat of Schwartz Communications for Teragram Corporation, +1-781-684-0770, firstname.lastname@example.org
Web site: http://www.teragram.com/info