Add user pseudonyms in data analysis

When analyzing anonymous user data in a team, I often take an extra step to help discussion: converting user identifiers to popular English name pseudonyms.

Pseudonyms tend to make the data more welcoming to team members that aren’t working directly with it, and helps you follow trends and outliers. It also helps in your visual sanity checks during analysis: names are simply easier to remember, thus helping you spot problems when inspecting the data.

Popular baby names are readily provided by the Social Security office, and I usually keep a derivative text list handy. In the simplest case, you can simply convert each unique id into a name. When I want to safeguard against name assignments changing as the data changes, I’ll save the ID>Name conversions in a basic CSV.

Below is a very basic example written in R to show how easy it is to do:

Low-Effort Crowdsourcing

Sentence generation with choice-based typing. The program prompts a user to choose one of two words that are likely to come after the previous words, allowing them to generate a whole sentence by low-effort interaction.—programmed by Jeff Bigham

How small can a crowdsourcing contribution be?

At November’s CrowdCamp workshop, a group of us got together and prototyped a number of sample systems to see how low-effort crowdsourcing would work. We posted a report at Follow the Crowd.

Our prototypes were silly at times, but helped us think about the mixture of low-effort input methods and non-distracting user contexts where low-effort crowdsourcing would work.

The ideas we prototyped, available at Github, include:

  • A binary tweeting interface, that lets you type sentences using a choice between common words
  • A passive image voting interface that captures a user’s smile as a ‘like’
  • A browser extension proof-of-concept that lets a worker complete tasks while a page is load
  • A hot-or-not style interface for choosing the better of two choices. The twist is that you’re choosing using affirmative grunts, so you can play it while listening (or pretending to listen?) to somebody!

Uh-huh. Yeah.

The emotive voting interface ‘likes’ an image if you smile while the image is on the screen, and ‘dislikes’ if you frown.

Details at Follow the Crowd. Team was Jeff Bigham, Kotaro Hara, Rajan Vaish,  Haoqi Zhang, and myself.

Stuff-getting and People-joining

Based on an XKCD comic, the Up-Goer Five text editor only lets you write using the one thousand most common words in English. Here are my attempts to describe what I do in crowdsourcing and information retrieval using only common words.

Information Retrieval

How do you find something from a lot of written stuff? If there are hundreds of things or more, you can’t look at all of them! One way we can find things is to use the words to understand the ideas. Then, when you search with a question, we can find the ideas that you are asking about and find the written things that answer your question. However, words and ideas aren’t exactly the same thing, so we look for ways to make better how a computer understands the ideas in your question and in the stuff you’re looking through.

When people get together on computers, they make fun, cool, and strange things. My job is to understand why they do it, and how we can can work together to fix problems in the same ways.

Progress Bar Timer


A general purpose productivity tracker

I just published Progress Bar Timer in the Chrome Web Store. It lets you set up general purpose trackers in the form of progress bars. There are counters, timers, and clocks. Code is at Github, so feel free to submit bug reports and suggestions.

The application was designed toward my productivity habits but – spurred by the sense of the public eye on Github – I’ve tried to make it useful to others. My favorite use has been to combine a counter and a clock side-by-side. For example, during my field exam, I maintained a bar of word count progress alongside a bar showing where in the two week writing period I was.

Progress Bar

Continue reading