Tagging Test

Most of these pages happened to be open in my Firefox browser. I took the URL and plugged it into http://del.icio.us/url to get their tag cloud. It didn't take long for me to discover that many pages are not specifically tagged but a few directories up, are tagged. For example, http://netatalk.sourceforge.net/2.0/htmldocs/installation.html#id2837734 no one has tagged, http://netatalk.sourceforge.net/2.0/htmldocs/ has been tagged by 12 people and http://netatalk.sourceforge.net/ has been tagged by 131 people. If I simplify the URL it will appear below the original URL. Interestingly, according to this article

"…a study of the del.icio.us website conducted by the Information Dynamics Lab at HP Labs, a stable tag pattern emerges after the first one hundred bookmarks are placed for a particular website. They attribute this synchronization to user imitation of popular tags and to a common knowledge base shared by users of the site. As a result, alternative views exist alongside popular ones without being disruptive to the pattern."

This will have to be tested further.

Of the 7 unique sites, 5 had a common tag of 'netatalk', 4 of 'linux', 3 of 'ubuntu', 2 of 'apple' and 'mac', etc. Seems to work fairly well even with limited tags. All of these links are related except for the last one. An algorithm would have to be created which looked at the main link, a simplified link and keywords/title to figure out where pages with minimal tagging belong as well as which tags are important relating to previously opened pages.

URL # of People Common Tags
http://netatalk.sourceforge.net/2.0/htmldocs/installation.html#id2837734 None None
http://netatalk.sourceforge.net/2.0/htmldocs/ 12 7-linux 5-reference 4-mac 4-networking 4-software 3-netatalk 3-network 2-apple 2-HOWTO 2-OSX 2-system:unfiled 2-unix
http://netatalk.sourceforge.net 131 79-linux 72-mac 54-apple 53-netatalk 42-network 41-software 39-osx 23-afp 20-unix 14-appletalk 14-networking 14-opensource 12-Server 9-bsd 9-storage 8-computer 6-Filesystem 5-free 4-apps 4-computers 4-fileserver 4-macosx 3-macintosh 3-sharing 3-technology
http://www.disgruntled-dutch.com/2007/general/how-to-get... 15 13-linux 10-netatalk 8-afp 8-OSX 7-howto 6-mac 4-apple 3-debian
http://bugs.debian.org/cgi-bin/bugreport.cgi?bug=191790 1 debian sysadmin netatalk package ssl
http://bugs.debian.org/ 5 4-debian debbugs linux irc
http://ubuntuforums.org/showthread.php?t=726502 none none
http://ubuntuforums.org/showthread.php 4 3-linux oracle dns projects ubuntu
http://ubuntuforums.org/ 2147 336-ubuntu 272-linux 176-Forum 123-Forums 91-community 86-support 79-opensource 47-howto 24-computer 14-computers (only the top ten)
http://netatalk.sourceforge.net/wiki/index.php/How... 1 netatalk ubuntu
http://ubuntuforums.org/showthread.php?t=347019 6 5-netatalk 5-ubuntu 4-howto 3-avahi 3-tutorial 2-leopard
http://www.iua.upf.es/~taussenac/wahwactor.htm 5 3-music sound-fx freeware tools audio plugin vst guitar wahwah guitareffect
page_revision: 15, last_edited: 1209402386|%e %b %Y, %H:%M %Z (%O ago)
Unless otherwise stated, the content of this page is licensed under Creative Commons Attribution-ShareAlike 3.0 License