Integrating Google Enterprise Search with Social Bookmarking
June 20th, 2007by Jeremy Thomas

I don’t generally write technical posts, but the geek inside me wanted to share some insights into how to “mashup” Google Enterprise Search and Social Bookmarking. Here’s what you’ll need:
- A Google Mini or Google Search Appliance
- Scuttle - an open source, delicious-like social bookmarking application
- Xalan - a Java-based, open source XSLT engine
The Value Proposition
The idea behind this is this: when a user performs a search, each of the results on the page display social bookmarking information (tags and “saved by” count) if that URL has been bookmarked by somebody within the enterprise. This helps the searcher make an more informed choice when deciding which content to look at. If somebody else inside the enterprise has found a relevant content item to be helpful then perhaps the searcher should look at it first. It also exposes the searcher to others within the enterprise who have a mutual interest in the content being looked for.
Technical Mumbo Jumbo
Out of the box the Google Search Appliance provides a great amount of flexibility in modifying how search page behaves and feels. This is done through modifying the default XSLT (which produces a UI nearly identical to google.com). So, ideally all I’d have to do to include scuttle into the search results is use the document() XSLT function invoke a scuttle API that returns XML data about the URL (including the tag list and saved by count). I could then aggregate this XML with Google results XML displaying a seamless result to the user. Unfortunately the Google Search Appliance XSLT engine does not support the document() function, so I’m left to run an external XSLT engine - hence Xalan.
So, here’s what I did:
- Downloaded Xalan and made some minor tweaks to the “SimpleServlet” demo application (class and package name change, modified the configuration to use my XSLT and invoke the Google Search Appliance to retrieve the results in XML format).
- Packaged the modified version of the “SimpleServlet” and deployed it on my Tomcat instance.
- Copied the Default XSLT style sheet from the Google Search Appliance, and modified it starting at line 2398 (the XSLT template for a snippet) adding code to invoke the Scuttle API and incorporate tags and saved by count at the bottom of the snippet. Saved the XSLT and bundled it with my “SimpleServlet” war file.
- Copied the scuttle “posts_get.php” API and made a new API that returns bookmark information for a given URL.
That’s it. Users now access my custom web application to perform enterprise searches (instead if using the default GSA frontend), but so far so good. Hopefully one day Google will support the document() function so we can do integration like this using the XSLT engine on the appliance. That would certainly make life a lot easier.
Social Search
April 5th, 2007by Jeremy Thomas
I recently wrote about how Search should be the core of Enterprise 2.0, and I’m encouraged to see the release of several new socially-oriented Enterprise Search tools. The first is a release of ConnectBeam at Honeywell (covered by Andrew McAffee here). The second is an announcement from BEA on its new Web 2.0 product called Pathways, which is covered over at the FASTForwardBlog.
Both of these merge social bookmarking metadata (tags, users) with organic Enterprise Search results. This is a fantastic idea for two reasons:
- Users can make a better judgement on the value of the document based on social bookmarking metadata (i.e. how many people have bookmarked it).
- Users are exposed to others who have similar interests to what they’re searching for (i.e. if a document I find useful has been bookmarked by 5 other people, maybe those people have found other documents I might also find useful), and this facilitates collaboration.
Internally we’ve developed a prototype that does something similar. It fuses Google Enterprise Search results with a custom social bookmarking system displaying the folksonomy that has developed for a given document and a link to other users who’ve bookmarked it:

We’ve seen interest in this extension to Enterprise Search from several customers. I’m not at liberty to say who at this point, but in a few month’s time we may see the successful delivery of this system and I may be able to comment on it further.
Search is the Core
March 13th, 2007by Jeremy Thomas
I’m being a bit redundant with this post, but I cannot over-emphasize how important Enterprise Search is to Enterprise 2.0. As Dion Hinchliffe, Chief Technology Officer for the Web 2.0 Advisory and consulting firm Hinchliffe & Company, puts it,
“Discoverability isn’t an after thought , it’s the core”
Enterprise 2.0 systems must be centered around Search - the ability to discover content and people. It’s the first component in McAfee’s SLATES methodology, and “Google-like” search it is a fundamental feature that most enterprises lack. Recent experiences have shown me that companies are eager to implement this thing called Web 2.0, but on their roadmap their first priority is making their existing content discoverable.
In my humble opinion search-centric Enterprise 2.0 systems, like Suite Two, will drive the greatest amount of user acceptance. 80% of internet experiences begin with search and there’s no reason to assume this should be any different inside the firewall, even with an Enterprise 2.0 ecosystem in place.
Follow Me