Yahoo! Opens Content Analysis Technology to all Developers

As the premier digital media company, Yahoo! publishes tons of content every day. In addition to publishing it, we do a lot of work behind the scenes to analyze and understand that content in a scalable and algorithmic way. Today we’re pleased to open up our content analysis technology to the world to help developers build their own fantastic experiences for their sites and users.

The newly launched Yahoo! Content Analysis service replaces Yahoo!’s popular Term Extraction service and now provides advanced content analysis on either text or a URL, leverages Yahoo!’s state of the art machine-learned ranking (MLR) technology to extract key terms from the content, and, more importantly, to rank them based on their overall importance to the content. The output you receive contains the keywords and their ranks along with other actionable metadata.

On top of entity extraction and ranking, developers need to know whether key terms correspond to objects with existing rich metadata. Having this entity/object connection allows for the creation of highly engaging user experiences. The Y! Content Analysis output provides related Wikipedia IDs for key terms when they can be confidently identified. This enables interoperability with linked data on the semantic Web.

Finally, sometimes you’re interested in the whole document in addition to its key terms. We provide classification for your content as well. We’ll map your content or URL to our deep and descriptive taxonomy.

Y! Content Analysis supports US English and Chinese languages for entity detection and ranking. Classification supports only English and more languages will be added gradually.

Our new service replaces the current Term Extraction Service, which is expected to end on March 31, 2012. We will continue to support the Term Extraction requests, but calls must be directed to our YQL table since we’ll be shutting down the non-YQL service. More details can be found on today’s YDN blog post.

The new features and MLR are supported only in the new request format. Give it a try today!

– Peter Levinson, Product Manager, Yahoo! Content Analysis

  • 22 Comments
  • Subscribe
back to yahoo! search

subscription options

Facebook Fans

latest posts

archives