Searching Wikipedia Sucks!

Post Image Have you tried searching Wikipedia lately? Don’t bother, because you probably won’t find what you’re looking for! I am continually amazed at how terrible the Wikipedia search results are. Here’s an example of what I mean. Go to Wikipedia, type “al gor” in the search box, and click the search button. You should see something like this. That’s right, the top results are Al-Merrikh, Cy-Gor, Firouzabad, and Kagame Inter-Club Cup.

Absolutely terrible! If you type the same thing in the search box at Google, not only do you get accurate results, but Google prompts you with “Did you mean: al gore”. Why yes, I did! So why is searching Wikipedia so bad?

Part of the problem is that Wikipedia actually has two search modes: “Go” and “Search”. If you type “Al Gore” (spelled correctly) in the box and click Go, you’re taken right to the entry about Al Gore. If you instead click Search, you’re taken to a list of articles that contain or reference “Al Gore”. You can read more about searching Wikipedia here. So they’ve sort of complicated things by including two buttons instead of just one. The Go button is useful when you know the name of the article you want, but useless otherwise.

The other part of the problem is that the search algorithm just plain sucks. I know they don’t have a lot of resources, but you’d think that one of the most popular websites on the web could have a decent search feature. Matching “al gor” with “al gore” is a problem that has been solved for years, yet Wikipedia doesn’t even come close to accomplishing it!

Wikipedia itself mentions external search engines as a way to find what you’re looking for, but they aren’t really much better. For instance, if you type “al gor” at the special Google search for Wikipedia page, you do get the correct Al Gore entry as the first result, but the rest are not relevant at all.

So here’s where we’re at. Google knows that if you type “al gor” you probably mean “Al Gore”. Wikipedia knows about all of the entries that reference “Al Gore”. What we need is a way to combine the two! Is that really so much to ask?

If you know of a better way to search Wikipedia, please let me know!

JihadOnYou: Declare holy war!

Post ImageI was reading Mashable today, and came across this post on a new website called JihadOnYou. Apparently the site was built over a single weekend – no word on how long it took them to come up with the name. Here’s the description from Mashable:

No matter what it is that has made your day a little bit more miserable, simply go to this site, rant about it, and “declare holy war” on it. Whether it be your annoying co-worker, an ex-girlfriend, the loaner car from the dealership, whatever it is, this is your place to rant. Other users then can rate your Jihad to decide if it’s worthy ala-Digg style.

Most of the comments at Mashable discuss the name, which could be described as offensive. To that I say bollocks!

If a word is “politically incorrect” or otherwise offensive, should you avoid it at all costs? My opinion is no. The word “jihad” will continue to carry the connotations it currently does only if we restrict its use. I don’t expect JihadOnYou to change the meaning of the word by itself, but every little bit helps. And yes, I realize that jihad is a word with a lot of history.

As for the site itself – it’s kinda neat! The about page says “we’re here to entertain, not educate” and to that end I think they have succeeded. It’s pretty hard to visit the site and not laugh!

Read: Mashable

NY Times article on Pownce made me laugh

Post ImageAfter writing my review of Pownce a few weeks ago, I figured I’d never write about the site again. However, after reading an idiotic article published in the New York Times yesterday, I knew I’d have to. Author Jason Pontin had me shaking my head right from the opening paragraph:

JUST now, the hottest startup in Silicon Valley — minutely examined by bloggers, panted after by investors — is Pownce, but only a chosen few can try out its Web site.

Hottest startup in the valley? News to me. Maybe three or four weeks ago. Anyway, let’s continue.

Within days, invitations were selling on eBay for as much as $10. Mr. Rose has declined all requests to be interviewed about the service, including my own. But as a consolation, he sent me a coveted invitation. I enjoyed the rare thrill of cyberhipness — and got to experiment with the site.

Coveted? Are you kidding me? Pownce tells me I have nine invites to give out. I’ve had them for weeks. I am positive I’m not the only one. Sorry Jason, receiving an invite to Pownce is anything but a hip cyber experience.

After some general information and background on Kevin Rose, Jason concludes that media executives should keep an eye on Pownce:

What struck me most was the site’s potential to be powerfully disruptive. Most file-sharing occurs on public sites, which can be monitored by media companies; if the users violate copyrights, the sites or the users themselves can be threatened into compliance or litigated out of existence (as happened with the original Napster). File-sharing on Pownce would be difficult to police.

If I didn’t know any better I’d think Jason was trying to make a joke. Because I sure laughed.

The RIAA has sued children, senior citizens, and everyone in-between. They’ve shut down company after company, and they’ve successfully petitioned ISPs for records detailing the activities of their subscribers. Somehow I don’t think policing Pownce (a system which knows exactly who is sharing what with whom, btw) would be a problem. Evidently Jason hasn’t heard of BitTorrent, which actually does make it difficult to police file-sharing (especially with the recent work done on protocol encryption).

I really wish the NY Times would stop publishing useless fluff pieces like this one.

I should mention that my main criticism of Pownce is set to be remedied soon – they are starting an API. Should be available in September, though the undocumented API that their desktop app uses has already been, um, documented.

Read: NY Times

The Gatekeepers of Privacy

Post ImageAs you know, I don’t worry that much about online privacy. In fact, I think it’s a huge waste of time to be overly concerned about privacy on the web. I always keep two things in mind:

  1. There is no such thing as private information.
  2. If someone looks at information online and draws a negative impression about me, I have larger problems than privacy to worry about.

So far my strategy has been working fairly well. To my knowledge I haven’t missed out on any opportunities because of information about me found on the web – quite the opposite in fact.

For some reason though, I am fascinated by the worries and concerns of others when it comes to information privacy. And believe, me there are a lot of worriers out there. So many, it seems, that Global TV‘s troubleshooter looked at the security of Facebook and other popular websites last night (unfortunately they haven’t full embraced the new web, and the video is not available on their site).

They contacted a local “hacking” firm, and asked them to review Facebook, Gmail, and other popular sites. The gentleman they spoke to couldn’t have been more cliché – long hair, super geeky, could be mistaken for a girl, you know the type. Anyway, they apparently spent over 30 hours trying to “hack” into Facebook and couldn’t get in. I just shook my head through all of this. They deemed Facebook “very secure”. Well, problem solved I guess, haha!

Then they spoke to a professor from the UofA (if I remember correctly) who said that living under the assumption that your information is safe is a dangerous thing to do. Finally someone smart! The segment then ended with the anchors asking each other if they were on Facebook (they aren’t, unfortunately). Oh and the suggestion that you should read the privacy policy of every site you visit (yeah, cuz that’s going to happen).

It doesn’t matter how secure Facebook is. Privacy is not about technology. If someone wants to find out something about you, they will. Social engineering, dumpster diving, and many other techniques are far more effective than trying to hack into a site like Facebook. More importantly, there’s no need to – just create your own Facebook account! Chances are, the person you’re interested in hasn’t adjusted their privacy settings anyway.

For its part, Facebook follows two core principles:

  1. You should have control over your personal information.
  2. You should have access to the information others want to share.

A respectable policy, no doubt. Here’s the problem though. Let’s say I give access to certain information only to my brother. No one else (in theory) can see it, right? Wrong. I can give my brother access to the information, but I can’t restrict him from doing something with it.

Technology is just a tool. People are the gatekeepers of privacy.

Why is Facebook so addicting?

Post ImageFor those of you who use Facebook this will come as no surprise: I’m addicted. I don’t know what it is about the site, but something has me completely hooked. Lately when I think social networking, I think Facebook – it seems to me they have found the magic formula. And I really want to understand what that formula is.

Here are a few “magic ingredients” that I have come up with:

  • Human Connection. I think it’s human nature to want to be connected to other humans. Obviously, this is the core of Facebook’s product. Sure you can share links and write notes and such, but the core idea is connecting with other people, and everything seems to be designed with this in mind (you can tag people in photos, notes, etc.)
  • User Interface. With the exception of the ugly banner on the left side, the site is clean and the layout is mostly consistent. I think for the same reason people love Google’s simple front page, people love Facebook’s simple interface.
  • News Feed. Aside from being an efficient way to display information, the news feed makes logging into the site many times a day worthwhile. There’s always something new to see. Try to imagine Facebook without the news feed…it’s hard isn’t it? This is a key feature.
  • Almost Live Casual Communication. I think Facebook is great for communication that falls somewhere in between instant messaging and email. Like a simple “hey how’s it going” that doesn’t require an immediate response, nor an entire email message (which would appear in your inbox alongside important messages and spam). The wall is definitely another key feature.

When they first decided to open the site up to everyone, expanding away from their original audience of college students, I wasn’t sure if it would work out. I figured it might make Facebook seem less attractive. Turns out my suspicions were wrong. Facebook is definitely going mainstream.

I’ll think about this some more, but what you do think – why is Facebook so addicting?

Oh, and if we’re not friends on Facebook yet, add me! Here’s my profile.

Clean & Hackable URLs

Post ImageA week ago, Roland Tanglao reiterated his love for clean URLs. Or perhaps more accurately, his hatred of dirty (?) URLs. Here’s what he wrote:

URLs with question marks, ampersands, etc should be banished to the Web 1.0 h*ll where they belong. I’ve been preaching the clean URL gospel for years but if I see one more WordPress blog with “?p” or one more Drupal site with “?q”, I’ll scream :-) Seriously if your webhost or your tech gal/guy can’t figure out how to use clean URLs, find somebody else. It’s 2007!

I couldn’t agree more. Here’s an example of what he means:

Dirty: http://example.com/articles.html?articleid=123&tag=rss
Clean: http://example.com/articles/123/rss

Clearly I prefer the second one, and I’m guessing you do too. I’m going to go one step further though, and say that not only should URLs be clean, they should be hackable! What does that mean? Let me give you an example:

http://mastermaq.podcastspot.com/episodes/FF7962/license
http://mastermaq.podcastspot.com/episodes/FF7962
http://mastermaq.podcastspot.com/episodes

http://mastermaq.podcastspot.com

The first link is for the licensing information of an episode. All you’ve got to do is “hack” off the end and you get the episode itself. One more hack and you get all the episodes. And finally, you’re left with the entire podcast. It’s pretty logical right? And it would be trivial to replace the episode ID with another one, or /episodes with /tags, etc. That’s what I mean by hackable – they are easily modified to get you where you want to go.

Here’s another example:

http://mastermaq.podcastspot.com/episodes/archive/2007/02/24

That will show you all episodes for February 24th, 2007. The URL is readable, and immediately you understand what it is doing. What if you want a different day? Replace 24 with something else. Just the month? Hack off the 24. You get the idea.

Clearly I am drinking the clean & hackable URLs koolaid, and as a result Podcast Spot has nothing but clean, hackable URLs. If you’re working on a web project, consider doing the same – your users will thank you for it.

Happy Birthday Yahoo!

Post ImageOn March 2nd, 1995 the site that started life as “Jerry’s Guide to the World Wide Web” incorporated as Yahoo (with an exclamation of course). I remember the early days, when all the pages had grey backgrounds and seemed to lack structure. It sure has come a long way. Tony Long at Wired explains:

Originally founded as a search engine/web directory, the company expanded rapidly through acquisitions to diversify into a full-blown internet service company, offering e-mail, instant messaging, social networking, online shopping and news, among other things.

I like Yahoo!, in case you hadn’t noticed, despite their growing pains.

And here’s a cool bit of trivia I just read on Wikipedia: if you click the exclamation point in the Yahoo logo on the homepage, you’ll hear the Yahoo yodel!

Read: Wired

A Rant About MySpace

Post ImageI hate MySpace. I simply cannot stand it. The navigation is horrible. The design is ugly. Their URLs are the most unfriendly ever. Random people add me to their “friends” list. Users have too much control over the look of the pages…which usually means that they end up making the pages painful to look at. Dancing text, repeating background images that were never meant to repeat, music that starts playing automatically, etc. I really cannot fathom how so many millions of people use MySpace on a daily basis.

Quite possibly the only thing I like about MySpace is that it runs on .NET and is therefore an excellent case study/example. But that would be the only reason.

Every single time I look at MySpace I cringe. Maybe I just don’t get it?

Wired News gets Odeo all wrong

Post ImageI think the staff at Wired News must have missed the memo about Odeo. In a list of Web 2.0 Winners and Losers published today, they included Odeo on the winners list. They praised the service, saying that Odeo “breezed in and de-mystified the podcast.” Huh, is that really what happened?

Not according to Odeo co-founder Evan Williams, who when giving a talk last week said Odeo failed for five main reasons:

  • “Trying to build too much”
  • “Not building for people like ourselves”
  • “Not adjusting fast enough”
  • “Raising too much money too early”
  • “Not listening to my gut”

De-mystified the podcast? That would explain why the vast majority of the population doesn’t know what a podcast is. They certainly know what MySpace or YouTube is though, yet MySpace appears on Wired’s losers list.

In some ways, the list that was voted on by Wired News readers is much more accurate – Odeo doesn’t appear on either list. This is the wisdom of the crowd at work! I don’t think they can be described as winners or losers yet, because Odeo seems to be finding their way still. I am willing to give them the benefit of the doubt, to wait and see if they can turn it around.

The funniest part of the Wired article is this:

In the interest of brevity, I’ve chosen five sites from each category. The web services industry certainly has more than five winners and five losers, so we’ve only highlighted the exemplars.

I’m not exactly sure what reporter Michael Calore considers the definition of “exemplary” to be, but I am quite certain it’s different from my definition. And probably different than the dictionary’s definition too. The first five that came to mind for me certainly didn’t include Writely or Odeo (mine would be Flickr, del.icio.us, YouTube, MySpace, and digg).

Read: Wired News

Apple Podcasting Site Broken!

Post ImageThe new nanos are great, Apple still rules digitial music with the iPod and all that, but they’ve broken podcasting. Well, they’ve broken their own podcasting site anyway. I went to look at the iTunes Podcasting spec, and noticed that the page can no longer be found! Seems the redesign for the new stuff broke the website. Well done Apple!

And it’s a shame too, because http://www.apple.com/podcasting was such a nice URL, wouldn’t you say?

A search for podcasting on the support site only gives the Podcasting FAQ. And the link on that page to the podcasting page remains broken. Fortunately, Google comes to the rescue. You can see cached versions of the podcasting page and the tech specs.

Maybe they are going to be updating the spec?