<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: Entity Extraction &amp; Content API Evaluation</title>
	<atom:link href="http://blog.viewchange.org/2010/05/entity-extraction-content-api-evaluation/feed/" rel="self" type="application/rss+xml" />
	<link>http://blog.viewchange.org/2010/05/entity-extraction-content-api-evaluation/</link>
	<description>Stories powering progress</description>
	<lastBuildDate>Tue, 23 Nov 2010 17:37:50 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.9.2</generator>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
		<item>
		<title>By: John Lehmann</title>
		<link>http://blog.viewchange.org/2010/05/entity-extraction-content-api-evaluation/comment-page-1/#comment-3626</link>
		<dc:creator>John Lehmann</dc:creator>
		<pubDate>Tue, 02 Nov 2010 22:52:51 +0000</pubDate>
		<guid isPermaLink="false">http://blog.viewchange.org/?p=31#comment-3626</guid>
		<description>While everyone&#039;s throwing their hat in the ring, I thought Extractiv might as well. :)

Extractiv&#039;s novelty is that we run our NLP on a web crawler, so you can transform the open unstructured web into structured semantic output.  This is great when you want to discover information from the web as opposed to process collections of documents you already have.  For the latter case, we provide a REST-based on-demand service. 

We currently provide over 150 entity types, relations, and more, in output formats including JSON and RDF.  I&#039;m with Tom Tague when he said &quot;Entities describe the news – facts and events are the news.&quot;  We like entities, but want to provide a lot more than that.  We&#039;re new, so bear with us as we still have some additional services to roll out, including entity linking, which should be within a few weeks.

To usher us in as a late contender to this post, I&#039;ve put URLs by which you can get live results on these 3 transcripts (or any URL), which hits our demo server.  I promise I didn&#039;t doctor the results. :)

http://rest.extractiv.com/extractiv/?url=http://rest.extractiv.com/sample_docs/viewchange1.txt&amp;output_format=html_viewer#
http://rest.extractiv.com/extractiv/?url=http://rest.extractiv.com/sample_docs/viewchange2.txt&amp;output_format=html_viewer#
http://rest.extractiv.com/extractiv/?url=http://rest.extractiv.com/sample_docs/viewchange3.txt&amp;output_format=html_viewer#

website: http://www.extractiv.com
blog: http://blog.extractiv.com
docs: http://wiki.extractiv.com
twitter: http://twitter.com/#!/Extractiv (@extractiv)</description>
		<content:encoded><![CDATA[<p>While everyone&#8217;s throwing their hat in the ring, I thought Extractiv might as well. :)</p>
<p>Extractiv&#8217;s novelty is that we run our NLP on a web crawler, so you can transform the open unstructured web into structured semantic output.  This is great when you want to discover information from the web as opposed to process collections of documents you already have.  For the latter case, we provide a REST-based on-demand service. </p>
<p>We currently provide over 150 entity types, relations, and more, in output formats including JSON and RDF.  I&#8217;m with Tom Tague when he said &#8220;Entities describe the news – facts and events are the news.&#8221;  We like entities, but want to provide a lot more than that.  We&#8217;re new, so bear with us as we still have some additional services to roll out, including entity linking, which should be within a few weeks.</p>
<p>To usher us in as a late contender to this post, I&#8217;ve put URLs by which you can get live results on these 3 transcripts (or any URL), which hits our demo server.  I promise I didn&#8217;t doctor the results. :)</p>
<p><a href="http://rest.extractiv.com/extractiv/?url=http://rest.extractiv.com/sample_docs/viewchange1.txt&amp;output_format=html_viewer#" rel="nofollow">http://rest.extractiv.com/extractiv/?url=http://rest.extractiv.com/sample_docs/viewchange1.txt&amp;output_format=html_viewer#</a><br />
<a href="http://rest.extractiv.com/extractiv/?url=http://rest.extractiv.com/sample_docs/viewchange2.txt&amp;output_format=html_viewer#" rel="nofollow">http://rest.extractiv.com/extractiv/?url=http://rest.extractiv.com/sample_docs/viewchange2.txt&amp;output_format=html_viewer#</a><br />
<a href="http://rest.extractiv.com/extractiv/?url=http://rest.extractiv.com/sample_docs/viewchange3.txt&amp;output_format=html_viewer#" rel="nofollow">http://rest.extractiv.com/extractiv/?url=http://rest.extractiv.com/sample_docs/viewchange3.txt&amp;output_format=html_viewer#</a></p>
<p>website: <a href="http://www.extractiv.com" rel="nofollow">http://www.extractiv.com</a><br />
blog: <a href="http://blog.extractiv.com" rel="nofollow">http://blog.extractiv.com</a><br />
docs: <a href="http://wiki.extractiv.com" rel="nofollow">http://wiki.extractiv.com</a><br />
twitter: <a href="http://twitter.com/#" rel="nofollow">http://twitter.com/#</a>!/Extractiv (@extractiv)</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Christoph Diefenthal</title>
		<link>http://blog.viewchange.org/2010/05/entity-extraction-content-api-evaluation/comment-page-1/#comment-2650</link>
		<dc:creator>Christoph Diefenthal</dc:creator>
		<pubDate>Mon, 20 Sep 2010 17:01:54 +0000</pubDate>
		<guid isPermaLink="false">http://blog.viewchange.org/?p=31#comment-2650</guid>
		<description>This its really a greate article about semantic services. We are also comparing entity extraction services right know - maybe you like our little Flash/Flex tool where you see all results at once:
http://www.veeeb.de/blog/news/entity-extraction-alchemyapi-evri-opencalais/
Have fun!
Christoph Diefenthal</description>
		<content:encoded><![CDATA[<p>This its really a greate article about semantic services. We are also comparing entity extraction services right know &#8211; maybe you like our little Flash/Flex tool where you see all results at once:<br />
<a href="http://www.veeeb.de/blog/news/entity-extraction-alchemyapi-evri-opencalais/" rel="nofollow">http://www.veeeb.de/blog/news/entity-extraction-alchemyapi-evri-opencalais/</a><br />
Have fun!<br />
Christoph Diefenthal</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Thomas Steiner</title>
		<link>http://blog.viewchange.org/2010/05/entity-extraction-content-api-evaluation/comment-page-1/#comment-1673</link>
		<dc:creator>Thomas Steiner</dc:creator>
		<pubDate>Thu, 05 Aug 2010 09:45:45 +0000</pubDate>
		<guid isPermaLink="false">http://blog.viewchange.org/?p=31#comment-1673</guid>
		<description>Thanks for this very insightful analysis. I repeated your tests on many different data sets, and can fully confirm your results. My focus was mainly on the Linked Data aspect, Zemanta and AlchemyAPI both do a really good job there.

@tomayac</description>
		<content:encoded><![CDATA[<p>Thanks for this very insightful analysis. I repeated your tests on many different data sets, and can fully confirm your results. My focus was mainly on the Linked Data aspect, Zemanta and AlchemyAPI both do a really good job there.</p>
<p>@tomayac</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Elliot Turner</title>
		<link>http://blog.viewchange.org/2010/05/entity-extraction-content-api-evaluation/comment-page-1/#comment-355</link>
		<dc:creator>Elliot Turner</dc:creator>
		<pubDate>Tue, 06 Jul 2010 22:43:32 +0000</pubDate>
		<guid isPermaLink="false">http://blog.viewchange.org/?p=31#comment-355</guid>
		<description>Hello and thanks for including AlchemyAPI in your evaluation!

Curious which NLP APIs from Alchemy were used in this comparison: AlchemyAPI provides both a Zementa-style &quot;Concept Tagging&quot; API and a traditional &quot;Entity Extraction&quot; API.  

If you&#039;re receiving a low number of entity results, you&#039;re likely using only one of our APIs.  For the maximum number of semantically linked results (for Keywords/Tags AND named Entities) be sure to use both Named Entity Extraction ( http://www.alchemyapi.com/api/entity/ ) and Concept Tagging ( http://www.alchemyapi.com/api/concept/ ).

AlchemyAPI&#039;s Concept Tagging API returns disambiguated and linked-data-enhanced results for a wide variety of concepts and named entity types.  When comparing against solutions such as Zementa you&#039;ll find this API provides a more &quot;apples to apples&quot; comparison.</description>
		<content:encoded><![CDATA[<p>Hello and thanks for including AlchemyAPI in your evaluation!</p>
<p>Curious which NLP APIs from Alchemy were used in this comparison: AlchemyAPI provides both a Zementa-style &#8220;Concept Tagging&#8221; API and a traditional &#8220;Entity Extraction&#8221; API.  </p>
<p>If you&#8217;re receiving a low number of entity results, you&#8217;re likely using only one of our APIs.  For the maximum number of semantically linked results (for Keywords/Tags AND named Entities) be sure to use both Named Entity Extraction ( <a href="http://www.alchemyapi.com/api/entity/" rel="nofollow">http://www.alchemyapi.com/api/entity/</a> ) and Concept Tagging ( <a href="http://www.alchemyapi.com/api/concept/" rel="nofollow">http://www.alchemyapi.com/api/concept/</a> ).</p>
<p>AlchemyAPI&#8217;s Concept Tagging API returns disambiguated and linked-data-enhanced results for a wide variety of concepts and named entity types.  When comparing against solutions such as Zementa you&#8217;ll find this API provides a more &#8220;apples to apples&#8221; comparison.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Rob DiCiuccio</title>
		<link>http://blog.viewchange.org/2010/05/entity-extraction-content-api-evaluation/comment-page-1/#comment-113</link>
		<dc:creator>Rob DiCiuccio</dc:creator>
		<pubDate>Fri, 04 Jun 2010 21:20:50 +0000</pubDate>
		<guid isPermaLink="false">http://blog.viewchange.org/?p=31#comment-113</guid>
		<description>I&#039;ve posted the test tool source code on GitHub:

http://github.com/robdiciuccio/Simple-API-Test-Tool</description>
		<content:encoded><![CDATA[<p>I&#8217;ve posted the test tool source code on GitHub:</p>
<p><a href="http://github.com/robdiciuccio/Simple-API-Test-Tool" rel="nofollow">http://github.com/robdiciuccio/Simple-API-Test-Tool</a></p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Mitul</title>
		<link>http://blog.viewchange.org/2010/05/entity-extraction-content-api-evaluation/comment-page-1/#comment-112</link>
		<dc:creator>Mitul</dc:creator>
		<pubDate>Fri, 04 Jun 2010 18:17:35 +0000</pubDate>
		<guid isPermaLink="false">http://blog.viewchange.org/?p=31#comment-112</guid>
		<description>Very nice article comparing different entity extractor and content apis.

Just to mention, Kosmix.com builds topic pages for any entity and might be worth mentioning as a content provider. For example, http://www.kosmix.com/topic/akon or http://www.kosmix.com/topic/Splice_(film)

Also, Kosmix provides an api to recognize, disambiguate, and link entities in a document. For example, http://www.healthboards.com/boards/showthread.php?t=748185

Just to mention, I work at Kosmix, and appreciate your article comparing different products.</description>
		<content:encoded><![CDATA[<p>Very nice article comparing different entity extractor and content apis.</p>
<p>Just to mention, Kosmix.com builds topic pages for any entity and might be worth mentioning as a content provider. For example, <a href="http://www.kosmix.com/topic/akon" rel="nofollow">http://www.kosmix.com/topic/akon</a> or <a href="http://www.kosmix.com/topic/Splice_(film)" rel="nofollow">http://www.kosmix.com/topic/Splice_(film)</a></p>
<p>Also, Kosmix provides an api to recognize, disambiguate, and link entities in a document. For example, <a href="http://www.healthboards.com/boards/showthread.php?t=748185" rel="nofollow">http://www.healthboards.com/boards/showthread.php?t=748185</a></p>
<p>Just to mention, I work at Kosmix, and appreciate your article comparing different products.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Emiliano Pasqualetti</title>
		<link>http://blog.viewchange.org/2010/05/entity-extraction-content-api-evaluation/comment-page-1/#comment-107</link>
		<dc:creator>Emiliano Pasqualetti</dc:creator>
		<pubDate>Thu, 03 Jun 2010 16:53:57 +0000</pubDate>
		<guid isPermaLink="false">http://blog.viewchange.org/?p=31#comment-107</guid>
		<description>Well done Andraz!
Congratulations to you and your team.</description>
		<content:encoded><![CDATA[<p>Well done Andraz!<br />
Congratulations to you and your team.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Andraz Tori</title>
		<link>http://blog.viewchange.org/2010/05/entity-extraction-content-api-evaluation/comment-page-1/#comment-14</link>
		<dc:creator>Andraz Tori</dc:creator>
		<pubDate>Wed, 19 May 2010 18:02:22 +0000</pubDate>
		<guid isPermaLink="false">http://blog.viewchange.org/?p=31#comment-14</guid>
		<description>@Frederick:
You need to use markup_limit parameter if you want to get more entities back from Zemanta. Zemanta only returns &#039;known&#039; entities that can be tied to known web addresses, not unknown ones.

@Bob McCreary:
Zemanta isn&#039;t designed to run on such short sentences (albeit it works most of the time). Try it with larger article. Also you seem to be looking at &quot;keywords&quot; instead of &quot;markup&quot; part of the response.

bye
Andraz Tori, CTO at Zemanta</description>
		<content:encoded><![CDATA[<p>@Frederick:<br />
You need to use markup_limit parameter if you want to get more entities back from Zemanta. Zemanta only returns &#8216;known&#8217; entities that can be tied to known web addresses, not unknown ones.</p>
<p>@Bob McCreary:<br />
Zemanta isn&#8217;t designed to run on such short sentences (albeit it works most of the time). Try it with larger article. Also you seem to be looking at &#8220;keywords&#8221; instead of &#8220;markup&#8221; part of the response.</p>
<p>bye<br />
Andraz Tori, CTO at Zemanta</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Bob McCreary</title>
		<link>http://blog.viewchange.org/2010/05/entity-extraction-content-api-evaluation/comment-page-1/#comment-12</link>
		<dc:creator>Bob McCreary</dc:creator>
		<pubDate>Wed, 19 May 2010 17:39:23 +0000</pubDate>
		<guid isPermaLink="false">http://blog.viewchange.org/?p=31#comment-12</guid>
		<description>&quot;John Smith walked down the street in Paris, in Lamar County.&quot;

Zemanta doesn&#039;t get &quot;John Smith&quot;, or &quot;Paris&quot;.  

Instead it comes back with &quot;Georgia&quot; (strange), &quot;United States&quot;, and &quot;Lamar County&quot;.

Many of the other listed APIs are detecting the entities missed by Zemanta, so I&#039;m at a loss as to why it&#039;s listed first.  Seeing similar results for pretty much every news article I try.</description>
		<content:encoded><![CDATA[<p>&#8220;John Smith walked down the street in Paris, in Lamar County.&#8221;</p>
<p>Zemanta doesn&#8217;t get &#8220;John Smith&#8221;, or &#8220;Paris&#8221;.  </p>
<p>Instead it comes back with &#8220;Georgia&#8221; (strange), &#8220;United States&#8221;, and &#8220;Lamar County&#8221;.</p>
<p>Many of the other listed APIs are detecting the entities missed by Zemanta, so I&#8217;m at a loss as to why it&#8217;s listed first.  Seeing similar results for pretty much every news article I try.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Dave Black</title>
		<link>http://blog.viewchange.org/2010/05/entity-extraction-content-api-evaluation/comment-page-1/#comment-11</link>
		<dc:creator>Dave Black</dc:creator>
		<pubDate>Wed, 19 May 2010 17:06:21 +0000</pubDate>
		<guid isPermaLink="false">http://blog.viewchange.org/?p=31#comment-11</guid>
		<description>Seems to be quite a bit of variance in these Entity Extraction evaluations.

Another recent look into this space by Michael Fagan of the Bing Maps team listed OpenCalais near the bottom of the rankings, yet it&#039;s listed as #2 here.  

Here&#039;s a link to fagan&#039;s review:

http://faganm.com/blog/2010/01/02/1009/</description>
		<content:encoded><![CDATA[<p>Seems to be quite a bit of variance in these Entity Extraction evaluations.</p>
<p>Another recent look into this space by Michael Fagan of the Bing Maps team listed OpenCalais near the bottom of the rankings, yet it&#8217;s listed as #2 here.  </p>
<p>Here&#8217;s a link to fagan&#8217;s review:</p>
<p><a href="http://faganm.com/blog/2010/01/02/1009/" rel="nofollow">http://faganm.com/blog/2010/01/02/1009/</a></p>
]]></content:encoded>
	</item>
</channel>
</rss>
