Archived entries for Info Mgmt

Google Desktop Search

From Gary Price, at the wonderful ResourceShelf:

Google Launches a Desktop Search Application
About thirty minutes ago, Google launched a desktop search application. I’ve been using Google Desktop Search for the past 48hrs and I’m VERY impressed. Danny Sullivan has just posted (I contributed to the story) a review on Search Engine Watch. Random comments about GDS:

  • I love the fact that every web page viewed in your browser is automatically cached on your computer and immediately made full text searchable. Seruku, a product I wrote about in June, offers a similar type of service for a fee. Additionally, every time you make a change to a Word document or other local file a new cached copy is made. Now, you can easily review revisions to your work by taking a look at the various cached copies of the material.
    + One negative is that it DOESN’t index the full text of pdf files. Google says it’s coming soon.
  • The Copernic Desktop Search tool remains a very useful product and offers several features not found with the GDS.
  • Privacy issues? Of course. Understand what you’re making searchable and how easy it could be for someone to quickly search and find something on your computer. Be careful!
  • Most of my other comments have are merged into Danny’s review.

Direct LINK to This ResourceShelf Post

It’s so much easier sometimes to the let the pros do the writing <wink>.

New Yahoo! Beta: My Yahoo! Search

Check out the latest beta from Yahoo!

From the site:

Personal Search – search for what you want, the way you want.

  • Save the results you like. more
  • Block the ones you don’t. more
  • Share what you’ve found. more
  • Find anything you’ve saved. more

Read more about it on John Battelle’s Searchblog, on Jeremy’s blog, at Search Engine Watch or directly on the Y!Search Blog.

Interesting article on metadata driven search and browse on an intranet

Metadata based search and browse functionality on the NSW Office of Fair Trading intranet: a Case Study

http://www.oict.nsw.gov.au/content/6.1.1.22.OFTINT.asp

interesting notes from the case study

An authority list is a list of the terms that can be used to populate a given metadata element

The simple search executes a search across the ’subject’, ‘title’ and ‘description’ metadata fields as well as the text of HTML pages and attachments. If the search term has a synonymous term within the corporate thesaurus then this term will also be searched.

Monitoring user searching and browsing success
Search reports are generated daily and cumulated on a monthly basis. These reports show the number of searches performed, the most popular search terms, search terms which resulted in success and those which resulted in failure. To date we only have reliable data for the last three months and are still looking at how to interpret these reports and how we can use them to develop intranet content and improve search functions.

interesting notes from the conclusion

Assigning metadata centrally, rather than decentralising the process, was the correct decision. It became obvious to us that two people assigning metadata frequently results in higher quality metadata than many people assigning metadata infrequently. The usefulness of the search and browsing functions depends totally on good metadata content, so no matter how good our standards and tools are, without proficient metadata assigners the project would have failed.

interesting notes from external resources

The guiding principle for using qualifiers with AGLS elements, colloquially known
as the ?dumb-down rule?, is that a client (eg a person or software) should be able to
ignore any qualifier and use the description (element content) as if it were
unqualified. The remaining element value without the qualifier should continue to
be generally correct and useful for discovery and other management purposes.
from http://www.naa.gov.au/recordkeeping/gov_online/agls/AGLS_reference_description.pdf

Categories Defined

The world I live in is full of information. I spend my days thinking, reading and talking. There is so much information — it’s ALL information — that handling it and making sense of it become the real challenges.

The projects I take at work, and the questions I choose to think about are generally extremely interconnected. When I’m thinking about XHTML markup, am I not thinking about the internet? How does the realization of something differ from the design of something? Does it? In the world of news web sites, how exactly is the Sports section different from the News section? News related to sports is still news, right?

Anyways, I write all this because I’m starting out with this new blog (thanks for visiting!), and deciding how to categorize it is both an important and a difficult question. Here are the categories I’m going to use initially. I suspect they’ll evolve over time, but this is my best first effort:

  • Accessibility
  • Back-end + PHP + MySQL
  • Blogging
  • Browsers
  • CSS techniques
  • DOM + Scripting
  • Design thinking
  • Events
  • Humor
  • Information design
  • Knowledge + Content Management
  • Life and such
  • Mark-up techniques
  • Other user agents
  • Photos
  • Politics + News
  • References
  • Rules-based design
  • Sandbox
  • Safe keeping
  • Search engines
  • Stuff + Things
  • The Internet
  • Tools + Software
  • Travel
  • Web standards
  • Yahoo!

I must give broad credit to those that came before me. I borrowed categories from D. Keith Robinson at Asterisk, Richard Rutter at clagnut and others. (Lots of the categories come from my blogging experience behind my company’s firewall.

For more information and thinking on categories and taxonomies, check out this article by Lars Marius Garshol: Metadata? Thesauri? Taxonomies? Topic Maps! Making sense of it all. I’ll probably point to this article again in the future, but it’s relevant to this post, so enjoy!

cheers, nate



San Francisco, California | Creative Commons By-2.5 License | Contact

RSS Feed. This blog is proudly powered by Wordpress and uses Modern Clix, a theme by Rodrigo Galindez.