<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Webdogs 2.0 &#187; pika</title>
	<atom:link href="http://www.webdogs.org/tag/pika/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.webdogs.org</link>
	<description>Webdogs 2.0 ~ data, design and derring-do since, uh, whenever</description>
	<lastBuildDate>Mon, 26 Jul 2010 21:04:16 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.9.1</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>Pika and the Google Search Appliance make nice</title>
		<link>http://www.webdogs.org/2010/04/02/pika-and-the-google-search-appliance-make-nice/</link>
		<comments>http://www.webdogs.org/2010/04/02/pika-and-the-google-search-appliance-make-nice/#comments</comments>
		<pubDate>Fri, 02 Apr 2010 12:13:42 +0000</pubDate>
		<dc:creator>Brian Lawlor</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[findability]]></category>
		<category><![CDATA[gsa]]></category>
		<category><![CDATA[pika]]></category>

		<guid isPermaLink="false">http://www.webdogs.org/?p=1351</guid>
		<description><![CDATA[For those who have followed The Findability Project, I am pleased to report we have surmounted the basic technical problems of targeting our Pika CMS with the Google Search Appliance.
The back story is one I have purposefully repeated whenever giving a presentation about the project, namely, that our Pika Plan A did not work. We [...]]]></description>
			<content:encoded><![CDATA[<p>For those who have followed <a href="http://www.webdogs.org/the-findability-project-archive/">The Findability Project</a>, I am pleased to report we have surmounted the basic technical problems of targeting our <a href="http://pikasoftware.com/">Pika CMS</a> with the <a href="http://www.google.com/enterprise/search/gsa.html">Google Search Appliance</a>.</p>
<p>The back story is one I have purposefully repeated whenever giving a <a href="http://www.webdogs.org/2010/01/27/findability-slides-and-video-from-2010-tig-conference/">presentation about the project</a>, namely, that our Pika Plan A did not work. We encountered code anomalies in Pika that, among other things, cause it to auto-generate new case intakes and case records when it is crawled by the GSA. As a result, we were unable to use the GSA to crawl the Pika client case content dynamically generated as web pages. Plan A would have been the easiest, no-brainer way to go but we were not able to do so. So Plan B was to have the GSA target the Pika MySQL database directly. Status report: Mission accomplished.</p>
<p>There are GSA capacity issues for us, since our particular GSA&#8217;s one million &#8220;record&#8221; capacity means one million web pages or database records, inclusive, and these database records are not the same thing as the count for client case records. At any given time, we may have some 130,000 to nearly 200,000 client cases in our Pika system (and even more in archival data storage), but from a database perspective, these add up to multi-millions of &#8220;records,&#8221; e.g., various types of time records, case notes, contacts, and so on. Part of the challenge for us was to sort out which pieces of those millions of database records were the ones most needed and useful to our users.  </p>
<p>The solution? Using a well-tailored query, we have the GSA do a selective crawl of the Pika MySQL database to return the most commonly sought and used Pika content: Case numbers, client names, office designations and case notes&#8230; <i>tons</i> of case notes. The basic technical explanation is the GSA performs a database query, returns it as an XML feed, indexes that feed, against which the user&#8217;s search terms are queried and ultimately returned as viewable HTML</p>
<p>What does the the search result look like? A Google search result. The clickable link displays the case number, client name, LSNC office and primary advocate name, e.g., &#8220;90-10-123456 ~ John Client ~ Sacramento ~ Jane Advocate.&#8221; Below that it displays in-context text with the search terms highlighted in bold, essentially like a regular Google search result. Clicking the link dynamically displays the actual Pika case note shown in context. Assuming there are multiple possible matches for a particular Pika case record, there is a link to display all the &#8220;omitted results,&#8221; akin to how regular Google searches work, so the users can see all possible, not just probable matches. Clicking through the GSA search result link also gives the user direct clickable access to the particular client case record since clicking through takes the user to the actual Pika client case record.</p>
<p>That&#8217;s the name of that tune.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.webdogs.org/2010/04/02/pika-and-the-google-search-appliance-make-nice/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Pika 4.0 ~ LSNC customization hoo-hah!</title>
		<link>http://www.webdogs.org/2009/10/28/pika-4-0-lsnc-customization-hoo-hah/</link>
		<comments>http://www.webdogs.org/2009/10/28/pika-4-0-lsnc-customization-hoo-hah/#comments</comments>
		<pubDate>Thu, 29 Oct 2009 06:27:22 +0000</pubDate>
		<dc:creator>Brian Lawlor</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[lsnc]]></category>
		<category><![CDATA[pika]]></category>

		<guid isPermaLink="false">http://www.webdogs.org/?p=611</guid>
		<description><![CDATA[As promised, here is a set of 15 representative screenshots of Legal Services of Northern California&#8217;s most recent customization of Pika 4.0. To view a larger image, right click on it and open the image in a separate window. (Needless to say, none of the data field entries in the screenshots are real.) It shares [...]]]></description>
			<content:encoded><![CDATA[<p>As promised, here is a set of <a href="http://www.webdogs.org/dog_files/photos/pika_4/photos/">15 representative screenshots</a> of <a href="http://www.lsnc.net/">Legal Services of Northern California&#8217;s</a> most recent customization of Pika 4.0. To view a larger image, right click on it and open the image in a separate window. (Needless to say, none of the data field entries in the screenshots are real.) It shares obvious design elements with LSNC&#8217;s <a href="http://www.webdogs.org/video/portal_overview_2009-09-18.swf">shared portal</a>, which serves the &#8220;news and resource&#8221; purpose of the original Pika home page.</p>
<p>Yes, that&#8217;s right. Just (mouse) click our Pika home page three times and you are<a href="http://pikasoftware.com/"> magically transported back home</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.webdogs.org/2009/10/28/pika-4-0-lsnc-customization-hoo-hah/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Summer hiatus</title>
		<link>http://www.webdogs.org/2009/07/15/summer-hiatus/</link>
		<comments>http://www.webdogs.org/2009/07/15/summer-hiatus/#comments</comments>
		<pubDate>Wed, 15 Jul 2009 15:25:17 +0000</pubDate>
		<dc:creator>Brian Lawlor</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[findability]]></category>
		<category><![CDATA[pika]]></category>
		<category><![CDATA[reality]]></category>

		<guid isPermaLink="false">http://www.webdogs.org/?p=586</guid>
		<description><![CDATA[Many months of fish to fry ahead, including bearing down on LSNC&#8217;s customized rebuild of Pika 4.0 and working out a solution for integrating our Google Search Appliance. So, laying low until October 1. See you back here this Fall and in early 2010 at the next Austin TIG, when we serve up the whole [...]]]></description>
			<content:encoded><![CDATA[<p>Many months of fish to fry ahead, including bearing down on LSNC&#8217;s customized rebuild of Pika 4.0 and <a href="http://www.findabilityproject.org/?p=873" class="broken_link" >working out a solution</a> for integrating our Google Search Appliance. So, laying low until October 1. See you back here this Fall and in early 2010 at the next Austin TIG, when we serve up the whole enchilada.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.webdogs.org/2009/07/15/summer-hiatus/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Selecting GSA targets &#8211; Part Three: Quantification, Revision and Finalization</title>
		<link>http://www.webdogs.org/2009/03/22/selecting-gsa-targets-part-three-quantification-revision-and-finalization/</link>
		<comments>http://www.webdogs.org/2009/03/22/selecting-gsa-targets-part-three-quantification-revision-and-finalization/#comments</comments>
		<pubDate>Mon, 23 Mar 2009 04:03:26 +0000</pubDate>
		<dc:creator>Brian Lawlor</dc:creator>
				<category><![CDATA[planning]]></category>
		<category><![CDATA[specs]]></category>
		<category><![CDATA[gsa]]></category>
		<category><![CDATA[pika]]></category>
		<category><![CDATA[reality]]></category>
		<category><![CDATA[tfp]]></category>

		<guid isPermaLink="false">http://www.webdogs.org/findability/?p=729</guid>
		<description><![CDATA[There is a great deal of work proceeding behind the scenes and several key project elements are converging as we move toward finalizing this public project. Among other things, we have been working through modest but practical solutions for better placement and targeting of our existing 300,000+ repository documents, while solidifying all the additional Google [...]]]></description>
			<content:encoded><![CDATA[<p>There is a great deal of work proceeding behind the scenes and several key project elements are converging as we move toward finalizing this public project. Among other things, we have been working through modest but practical solutions for better placement and targeting of our existing 300,000+ repository documents, while solidifying all the additional Google Search Appliance (GSA) targets in our enterprise search sights, described in <a href="http://www.webdogs.org/2008/11/10/selecting-gsa-targets-part-two-the-practical-realities/">Part Two</a>. At the same time, we are in our own March (and April and May) Madness as we mount a rapid-fire round of trainings for each of our eight remote offices (spread out over 50,000 square miles of Northern California) on their new role in making Google Enterprise search work for them, which is to say for all of us. And as mentioned in earlier posts, we are working every bit as earnestly on our latest in-house build of the <a href="http://pikasoftware.com/">Pika CMS</a>.</p>
<p>The infusion of new, future content into the <a href="http://www.webdogs.org/2008/09/30/tfp-taxonomy-part-two-the-practice/">simplified structural taxonomy</a> we created is a separate challenge we will be posting about later. Dealing with our existing files is more immediate, more concrete. Groking those files has been one of the more interesting, <a href="http://www.webdogs.org/2008/09/30/tfp-taxonomy-part-three-the-anecdotes/">at times hilarious</a> parts of this project.</p>
<p>For those legal services programs interested in how the existing files from our eight local offices break out, here are the percentages for the seven most common &#8220;document&#8221; types:</p>
<ul style="margin-left: 50px;">
<li><strong>67%</strong> &#8211; WordPerfect (WPD)</li>
<li><strong>18%</strong> &#8211; Word (DOC)</li>
<li><strong>9%</strong> &#8211; Portable Document Format (PDF)</li>
<li><strong>3%</strong> &#8211; Excel (XLS)</li>
<li><strong>1%</strong> &#8211; Text (TXT)</li>
<li><strong>0.9%</strong> &#8211; Rich Text Format (RTF) </li>
<li><strong>0.6%</strong> &#8211; PowerPoint (PPT)</li>
</ul>
<p>(Discretion prohibits us from detailing the other file flotsam discovered on local office servers. That said, allow us to observe that some within our organization have extraordinarily good taste in photos taken by National Geographic, and not such good taste in music.)</p>
<p>We are totally on track for targeting most of our planned GSA targets: The existing office archive files listed above have long been targeted (although we still have a lot of work left to fit them into our structural taxonomy); over the last several months we have worked very hard to refresh and update (and remove, as warranted) the targeted content at LSNC&#8217;s various public websites; and we are very pleased with the quality of the <a href="http://www.webdogs.org/2009/02/26/comparing-google-sites-and-gsa-search-results-with-release-52-in-place/">GSA results we are getting out of Google Sites</a>.</p>
<p>This is all good news. In addition, we are putting in place a few more content channels: Targeting the content in our organization&#8217;s seven private Google discussion groups, and a program-wide canvass for select hard-copy training resource materials for <a href="http://www.webdogs.org/2008/11/12/converting-hard-copy-documents-for-addition-to-the-shared-repository/">digital conversion and addition to the shared repository</a>. It&#8217;s all good.</p>
<p>We have had one major disappointment: We discovered that there are significant, unanticipated technical challenges unique to the Pika CMS that thus far have prevented effective use of the GSA to target Pika content. The problem is not the GSA itself or configuring the GSA to target Pika. The GSA by design performs wholly benign, non-destructive crawls as it indexes targeted records. We did a huge amount of target testing and <a href="http://en.wikipedia.org/wiki/Search_engine_results_page">SERP</a> evaluation, and we were very pleased &#8212; actually, thrilled is a better word &#8212; with the results we were getting from Pika. The unanticipated problem is that the current version of Pika is not well optimized for use as an enterprise search target. There are code anomalies in Pika that, among other things, cause it to auto-generate new case intakes and case records when it is crawled by the GSA.</p>
<p>After an assessment by Pika Software, it is now apparent it will take something in the neighborhood of 200 hours of work to make Pika more receptive, shall we say, to an external crawl by the GSA. (Ouch!) So for now, we have to put that part of the project to the side. File under: Lessons Learned.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.webdogs.org/2009/03/22/selecting-gsa-targets-part-three-quantification-revision-and-finalization/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
	</channel>
</rss>
