<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Webdogs 2.0 &#187; metadata</title>
	<atom:link href="http://www.webdogs.org/tag/metadata/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.webdogs.org</link>
	<description>Webdogs 2.0 ~ data, design and derring-do since, uh, whenever</description>
	<lastBuildDate>Mon, 26 Jul 2010 21:04:16 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.9.1</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>Schmetadata 2: Judgment Day</title>
		<link>http://www.webdogs.org/2009/03/29/schmetadata-2-judgment-day/</link>
		<comments>http://www.webdogs.org/2009/03/29/schmetadata-2-judgment-day/#comments</comments>
		<pubDate>Sun, 29 Mar 2009 18:37:36 +0000</pubDate>
		<dc:creator>Brian Lawlor</dc:creator>
				<category><![CDATA[concepts]]></category>
		<category><![CDATA[metadata]]></category>
		<category><![CDATA[reality]]></category>
		<category><![CDATA[tfp]]></category>

		<guid isPermaLink="false">http://www.webdogs.org/findability/?p=796</guid>
		<description><![CDATA[Think of it as the Revenge of the Empiricists.
A few days ago my earlier post about metadata, with the conclusion that &#8220;we don&#8217;t need metadata,&#8221; caught the attention of Daniel Tunkelang. It just so happens that Daniel is the chief scientist at Endeca, a prominent enterprise information company. You know, like, he knows what he&#8217;s [...]]]></description>
			<content:encoded><![CDATA[<p>Think of it as the Revenge of the Empiricists.</p>
<p>A few days ago my <a href="http://www.webdogs.org/2009/03/26/metadata-schmetadata-relevance-and-reality/">earlier post about metadata</a>, with the conclusion that &#8220;we don&#8217;t need metadata,&#8221; caught the attention of <a href="http://thenoisychannel.com/about/">Daniel Tunkelang</a>. It just so happens that Daniel is the chief scientist at <a href="http://www.endeca.com/">Endeca</a>, a prominent enterprise information company. You know, like, he knows what he&#8217;s talking about.</p>
<p>Long story short, Daniel <a href="http://www.webdogs.org/2009/03/26/metadata-schmetadata-relevance-and-reality/#comments">commented</a> on my post here and then, after I responded, mirrored our discussion in <a href="http://thenoisychannel.com/2009/03/27/does-metadata-matter/">Does Metadata Matter</a>, a new thread at <a href="http://thenoisychannel.com/">The Noisy Channel</a>, his blog that hosts vigorous discussions of search design and other enterprise information issues <em>way</em> above my pay grade. Daniel and others there critically but constructively took me to task because of the empirically unsupported general conclusion that metadata is not needed, and the specific failure to make clear the anecdotal basis for our conclusion about metadata on this project. As you can see from the discussion at Daniel&#8217;s blog, they killed me on the empirical point. In response, <a href="http://thenoisychannel.com/2009/03/27/does-metadata-matter/#comment-2646">I better explained there</a> than I did in my original post why our conclusion about not needing metadata on this project makes sense.</p>
<p>The discussion there ended on Christopher&#8217;s <a href="http://thenoisychannel.com/2009/03/27/does-metadata-matter/#comment-2652">grace note</a>, something that the <a href="http://tig.lsc.gov/">TIG program</a> and others within the legal services community will appreciate:</p>
<p class="quote">
I also want to applaud your project which I should have done before, the mission your team has set is very admirable. It’s gratifying to see technology being put to use to help people who may not otherwise be able to have access to the same advances we do.
</p>
<p>If you&#8217;re interested in enterprise search, you should consider paying <a href="http://thenoisychannel.com/">The Noisy Channel</a> a visit.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.webdogs.org/2009/03/29/schmetadata-2-judgment-day/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Metadata Schmetadata, Relevance and Reality</title>
		<link>http://www.webdogs.org/2009/03/26/metadata-schmetadata-relevance-and-reality/</link>
		<comments>http://www.webdogs.org/2009/03/26/metadata-schmetadata-relevance-and-reality/#comments</comments>
		<pubDate>Fri, 27 Mar 2009 04:45:18 +0000</pubDate>
		<dc:creator>Brian Lawlor</dc:creator>
				<category><![CDATA[concepts]]></category>
		<category><![CDATA[findability]]></category>
		<category><![CDATA[gsa]]></category>
		<category><![CDATA[metadata]]></category>
		<category><![CDATA[reality]]></category>
		<category><![CDATA[taxonomy]]></category>
		<category><![CDATA[tfp]]></category>

		<guid isPermaLink="false">http://www.webdogs.org/findability/?p=755</guid>
		<description><![CDATA[These are our contentions:

Metadata matters.
Metadata adds worth to data.
Documents are data.
Keywords are the essential data in documents.
Keywords in context create knowledge.
Documents have worth because they contain knowledge.
Enterprise search finds keywords.
Findable keywords yield documents.
Findable keywords in context yield documents with knowledge.
Knowledge in documents has worth.
Metadata is not essential for enterprise search.
We don&#8217;t need metadata.

What&#8217;s our point? [...]]]></description>
			<content:encoded><![CDATA[<p>These are our contentions:</p>
<ul class="content">
<li>Metadata matters.</li>
<li>Metadata adds worth to data.</li>
<li>Documents are data.</li>
<li>Keywords are the essential data in documents.</li>
<li>Keywords in context create knowledge.</li>
<li>Documents have worth because they contain knowledge.</li>
<li>Enterprise search finds keywords.</li>
<li>Findable keywords yield documents.</li>
<li>Findable keywords in context yield documents with knowledge.</li>
<li>Knowledge in documents has worth.</li>
<li>Metadata is not essential for enterprise search.</li>
<li>We don&#8217;t need metadata.</li>
</ul>
<p>What&#8217;s our point? Before answering that question, we invite understanding of the context: This project is about implementation of enterprise search within a large but not humongous non-profit organization. We&#8217;re talking about 170 paid employees, with easily an equal number of volunteers of one kind or another. So let&#8217;s say for purposes of context that we have 350+ real people using our networked infrastructure. We have two &#8212; count &#8216;em, <em>two</em> &#8212; IT guys. We&#8217;re not talking Fortune 500 here. We&#8217;re not even talking Fortune 500,000. That&#8217;s our world.</p>
<p>Working on this project, we have evaluated what we need from metadata as part of enterprise search implementation. Our conclusion? We don&#8217;t need metadata.</p>
<p>Or better said, we don&#8217;t need to <em>add</em> metadata for a Google Search Appliance (GSA) to accomplish what we want to accomplish with enterprise search. We <em>could</em> use metadata more &#8212; and there are several very impressive features in a GSA that can exploit <a href="http://code.google.com/apis/searchappliance/documentation/52/metadata.html">external metadata</a> and <a href="http://www.google.com/enterprise/gsa/features.html#security">metadata biasing</a> of search results &#8212; assuming the organization has the resources to <a href="http://solutions.dowjones.com/cookbook/ebook_sla2008/cookbookebook.pdf">organize and manage metadata</a>. But as a practical matter, do we have the resources to go down that path and, ultimately, do we need it? No.</p>
<p>In fact, as part of this project, we have put a metadata model in place, a simple &#8220;labeling&#8221; or tagging system. It exploits our Sharepoint server installation with a practical (if kludgy) way to add metadata to files saved to a shared document repository. For example, when saving a file in a directory in our <a href="http://www.webdogs.org/2008/09/30/tfp-taxonomy-part-two-the-practice/">structural taxonomy</a>, as the user navigates &#8212; say, to the Income Maintenance folder&#8230;</p>
<p><img src="http://www.webdogs.org/project_files/ui_folder.png" alt="" /></p>
<p>&#8230;a dialog box pops up with a prompt to add one or more optional &#8220;LSNC labels&#8221; to the file, associating the file with additional folders or categories in our taxonomy:</p>
<p><img src="http://www.webdogs.org/project_files/dialog_overlay.png" alt="" /></p>
<p>In the above example, an Excel spreadsheet with unemployment data is being saved to the &#8220;Unemployment Insurance&#8221; folder, a subfolder under the &#8220;Income Maintenance&#8221; top-level directory, but is also marked or tagged as &#8220;Data-Statistics-GIS&#8221; and &#8220;Employment.&#8221; Even then, this kludge only works with Microsoft applications, which is to say Sharepoint doesn&#8217;t work as cooperatively with other applications we rely on, like WordPerfect, Adobe Acrobat and others.</p>
<p>Regardless, is the addition of metadata to documents a good thing? Obviously, yes. <a href="http://www.library.cornell.edu/dmlib/robson.pdf">Metadata matters</a>. (Taxonomy matters, too&#8230; yet <a href="http://gilbane.com/search_blog/2008/09/taxonomy_yes_but_for_what.html">to what purpose?</a>) Do you need to add metadata to documents for effective enterprise search, and specifically with a Google Search Appliance? Not really, not for what we are doing. Why not? Because improvements in search algorithms are such that <a href="http://blogs.heremenow.info/peter_bailey/2005/05/does-search-need-metadata-schemas.html">metadata is not needed to help the search</a>.</p>
<p>The poster child for these gains in enterprise search algorithms is, not surprisingly, Google whose <a href="http://www.enterprisesearchblog.com/2007/10/google-applianc.html">GSA has matured considerably</a>. Google is a verb. Microsoft (or Sharepoint) are not. A principal reason for that is Google years ago broke out early from the search-engine pack and raised the bar in terms of quality of search results. Google became what the average person now expects from search. That is why it is a verb. It is what most people do. They Google. Another reason is that Google <a href="http://www.enterprisesearchblog.com/2008/09/the-future-is-s.html">simplifies search</a>.</p>
<p>In the context of our project, at the scale and with the resources available to even a fairly large non-profit, what is practical or impractical in using metadata? And even if used, does it affect the quality of enterprise search results <em>enough</em> to warrant those additional costs in time and money?</p>
<p>So far, we don&#8217;t see it.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.webdogs.org/2009/03/26/metadata-schmetadata-relevance-and-reality/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>Setting the stage for a basic Google Search Appliance deployment</title>
		<link>http://www.webdogs.org/2009/02/08/guide-to-installing-and-configuring-a-basic-gsa-enterprise-search-platform/</link>
		<comments>http://www.webdogs.org/2009/02/08/guide-to-installing-and-configuring-a-basic-gsa-enterprise-search-platform/#comments</comments>
		<pubDate>Sun, 08 Feb 2009 19:24:36 +0000</pubDate>
		<dc:creator>Brian Lawlor</dc:creator>
				<category><![CDATA[gsa]]></category>
		<category><![CDATA[sharepoint]]></category>
		<category><![CDATA[firefox]]></category>
		<category><![CDATA[ie]]></category>
		<category><![CDATA[metadata]]></category>
		<category><![CDATA[moss]]></category>
		<category><![CDATA[tfp]]></category>

		<guid isPermaLink="false">http://www.webdogs.org/findability/?p=653</guid>
		<description><![CDATA[We have posted the first 16 sections of Step-by-Step: Installation and Configuration of a Google Enterprise Search Platform. These initial sections offer a basic, practical guide to how The Findability Project, with its project specs as ground zero, proceeded with installation of the Windows server software and settings, installation and configuration of Microsoft Office SharePoint [...]]]></description>
			<content:encoded><![CDATA[<p>We have posted the first 16 sections of <a href="http://www.webdogs.org/tfp-step-by-step-installation-setup-and-configuration/">Step-by-Step: Installation and Configuration of a Google Enterprise Search Platform</a>. These initial sections offer a basic, practical guide to how <strong>The Findability Project</strong>, with its <a href="http://www.webdogs.org/2008/08/21/86/">project specs</a> as ground zero, proceeded with installation of the Windows server software and settings, installation and configuration of Microsoft Office SharePoint Server (MOSS), and basic configuration of the SharePoint document site.</p>
<p>Much more is coming. These first several steps simply set the stage for getting into deployment of the Google Search Appliance (GSA), what we think are the more interesting and challenging parts. Among other things, upcoming are documentation of how to install and configure the connector to the GSA, and walkthroughs explaining how we have configured our GSA to accomplish the <a href="http://www.webdogs.org/2007/10/08/explicating-the-lsnc-findability-project/">goals of the project</a>.</p>
<p>(Kudos to Ed Lachgar, IT <em>extraordinaire</em> at <a href="http://www.lsnc.net/">Legal Services of Northern California</a> for writing up these first 16 sections.)</p>
]]></content:encoded>
			<wfw:commentRss>http://www.webdogs.org/2009/02/08/guide-to-installing-and-configuring-a-basic-gsa-enterprise-search-platform/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Tagging: Metadata for the Masses</title>
		<link>http://www.webdogs.org/2007/01/10/tagging-metadata-for-the-masses/</link>
		<comments>http://www.webdogs.org/2007/01/10/tagging-metadata-for-the-masses/#comments</comments>
		<pubDate>Wed, 10 Jan 2007 23:11:13 +0000</pubDate>
		<dc:creator>Brian Lawlor</dc:creator>
				<category><![CDATA[metadata]]></category>
		<category><![CDATA[tagging]]></category>
		<category><![CDATA[usability]]></category>

		<guid isPermaLink="false">http://www.webdogs.org/2007/01/10/tagging-metadata-for-the-masses/</guid>
		<description><![CDATA[This is a small doggie-byte of information I ran across recently that may be of interest to some: Tagging vs. Cataloging: What It&#8217;s All About. Chiara Fox of Adaptive Path does an especially good job of explaining the concept and the usability metaphor known as &#8220;tagging.&#8221; Yeah, we all now know about tags and how [...]]]></description>
			<content:encoded><![CDATA[<p>This is a small doggie-byte of information I ran across recently that may be of interest to some: <a href="http://www.adaptivepath.com/publications/essays/archives/000695.php">Tagging vs. Cataloging: What It&#8217;s All About</a>. Chiara Fox of <a href="http://www.adaptivepath.com/">Adaptive Path</a> does an especially good job of explaining the concept and the usability metaphor known as &#8220;<a href="http://en.wikipedia.org/wiki/Tag_%28metadata%29">tagging</a>.&#8221; Yeah, we all now know about tags and how to use them. But her article reveals explicitly what we may know intuitively  from using tags, but to which we can&#8217;t quite give a name. Really good stuff.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.webdogs.org/2007/01/10/tagging-metadata-for-the-masses/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>
