Thoughts on my first visit to CHI

I recently returned from Florence, Italy, where I made my first visit to the CHI conference. I was struck by a number of things:

  • Size. There were about 2,500 people at CHI.  I’d never been to a conference larger than 200 or 250 people.  I did not like that I could often look around and see nobody that I recognized.  I liked that the size brought lots of non-academics, lots of fun stuff to do, and lots of options for things to see and do.
  • Culture. I work in a computer science department; I’m used to feeling like I barely belong in the discipline (I am interested in the applications of computing and the impact of computing on people). CHI presented the opposite feeling: I felt like an uber-geek. The culture was: design, art, experiment, evaluation, psychology, sociology.  Strangely absent was much talk of things like algorithms or architectures.
  • Breadth. Past conferences I had attended had revolved around the presentation of peer-reviewed research papers.  Not true at CHI, where there were lots of alternatives: pseudo-research (e.g. “works in progress”, design competitions, etc.), interactive demos, panels, …  One pretty good feature of the conference was “CHI madness”, where presenters gave 30 second overviews of their work.  Very well orchestrated, and often fun to watch.

By the way, I presented a paper (Predictors of Answer Quality in Online Q&A Sites). It was a good experience. People seemed uniformly interested in the topic, and seemed excited to talk about what we learned and our plans for future work. I learned of other researchers who have started working in the Q&A area.  I also got moderately motivated and inspired.

Max

Traffic Report on Q&A Sites

Hitwise published a report on traffic to Q&A sites, which has apparently gone up 889% (!) over the past two years.

Basically, Yahoo! Answers dominates U.S. traffic, getting 74% of all hits categorized as Q&A. I wonder how closely correlated traffic volume is with the volume of contributions (questions asked, and answers provided).

The report also shares some demographics, which is nice. Slight bias towards women, slightly older population (mode 30s and 40s). As I’ve argued before, I think Q&A sites are overlooked by technologists and researchers because of this demographic.

Max

More reading:

Q&A Still Overlooked

For some reason, Q&A sites are mostly overlooked by analysts and journalists. One recent exception was an article from Slate, comparing Yahoo! Answers (unfavorably) with Wikipedia. A good read, if you’re interested in Q&A, social search, or mass collaboration web sites.

This morning, though, while catching up on my Technology Review RSS feed, I saw some analysis of Microsoft’s move to acquire Yahoo! A quoted analyst observes that this acquisition isn’t just about search, but also about Yahoo’s social properties, some of which are best in class. The interesting fact is that the two properties cited are Flickr and del.icio.us. Yes, important sites. I use both of them. What about Yahoo! Answers? Check this out:

Yahoo! Answers gets about as much traffic as Flickr, and about an order of magnitude more traffic than delicious. Also, note that while Flickr is a leader in the field (although Photobucket, Smugmug, Google’s Picasa, and Kodak’s offerings are strong competitors), Yahoo! Answers blows the rest of the Q&A field away in terms of activity. There is no other Q&A site in the US that comes close.

So, the search engine corporations know about the importance of Q&A sites. But the press and technology analysts still do not, apparently. (Nor do most researchers…that’s a different story)

I will speculate that the reason for ignoring Q&A sites to date has to do with the demographics of their users. While Flickr and Delicious are founded on relatively tech-saavy, relatively geeky users (i.e. the same demographic as technology writers), Q&A sites are often frequented by high-schoolers, stay-at-home moms, and other demographics that are typically ignored by the technology press.

What do you think? What are other reasons for ignoring Q&A sites?

Max

Activity-Balanced Clustering at RecSys 2007

Shilad, Dan, and I wrote a short paper called “Supporting Social Recommendations with Activity-Balanced Clustering“.  I presented the paper at Recommender Systems 2007 here in Minneapolis.  The idea is that standard clustering algorithms such as k-means simply don’t work well if you’re trying to create groups of users that manifest similar levels of activity.  In MovieLens, k-means placed 84% of the active users in a single cluster.  Thus, we mashed up some techniques from the clustering literature to create an algorithm that improved on k-means in terms of cluster quality and computational complexity, while giving us the balanced properties we sought.  One of the key insights is that stable matching algorithms are a clever way to evenly divide sets of things based on similarity metrics.

I received lots of feedback from conference attendees.  One of the most interesting questions (that I have not investigated) is whether there are some unique properties of the users that I clustered that led to the severe lack of balance with k-means.  I don’t think so - the 20,000 or so MovieLens users that we used as our data have two characteristics that seem particularly regular:

  • Power law activity
  • Normal distribution of ratings (1-5 scale)

Another person suggested that I simply didn’t tweak k-means well enough (and the other algorithms I tried).  Well, I turned plenty of knobs, many times.  In the process of the knob turning, I realized that clustering metrics are a fairly bad way of gauging the “feel” of a cluster of users.  I’d like to dig more into clustering metrics that reflect the different ways that groups of people might come together.

Max

Summer 2007 At Nokia Research

I recently returned from a very interesting summer at Nokia Research Center, Palo Alto. This lab is very young - it launched late last year - which created an atmosphere of exploration, and perhaps uncertainty. As an intern, this was a fun opportunity, as the lab culture (and especially my mentor, Joe McCarthy) encouraged original thinking and novel system building.

The project that I worked on is well summarized by Joe on his blog. It was a very fun project: lots of coding and lots of playfulness.

Some pictures: http://www.flickr.com/photos/maxharp3r/tags/nokia/

Max

A New Web Page

This post signals the launch of my new vanity page - hosted by a blogging service! It’s fabulous. A wysiwig editor lets me format text, a designer crafted the CSS, and all of the content is available via RSS. Plus, I can link this page to the blog posts that I write for the GroupLens Blog and the blog posts that I share on Google Reader.

I’ll try to keep you informed as to my whereabouts and research projects. Thanks for visiting!

Max