Seesmic numbers via the API 22-Feb-08
Posted by laza in Misc, Smalltalk.Tags: API, seesmic, statistics
trackback
Lately there has been a thread on seesmic about numbers in and around seesmic. There is also seesmic café episode 85 which presents additional numbers. I had some fun using the seesmic API and providing even more numbers.
Looking at about 40.000 of the latest videos posted on seesmic (during about a month):
English is the seesmic language
All in all ten different languages have been used. Not specified are all those videos from YouTube or qik. English dominates with 78%. This won’t come as a real surprise. Also not French being on the second place with 16%, because of the origin of the seesmic founders. Spanish is on the third place with under 1.5%. So far I think the order matches the overall distribution of spoken languages. Now let’s have a closer look at the remaining languages below 1%.
72 out of 40000 (0.18%) had been recorded in german followed by 43 italian, 31 japanese, 15 dutch, 13 chinese, 4 russian and 3 korean.
Thursday is the seesmic day
People don’t seem to like mondays.
Freida is the top seesmic user
The 40000 videos were posted by 1280 different users. Freida was the single user that posted the most videos, namely 1562. That are about 3-4 videos per hour constantly over a period of a month (assuming 8 hours of sleep per day).
Now how hard was it to get those numbers. It was very straight forward to have some implementation in Smalltalk to use the seesmic API. As an example the following snippet fetches information about the latest 50 videos on seesmic:
| session videos languages |
session := SMICSession username: ‘user’ password: ‘pwd’.
videos := session videolistMax: 50.
Doing the statistics is also very easy in Smalltalk:
"include above code here"
languages := Bag new.
videos do: [:video | languages add: video language].
Transcript show: languages sortedCounts.
Now instead of just showing the numbers of language distribution on the Transcript it is also easy to generate a URL that uses the Google graph API and produces above nice graphs.

Veddy veddy interesting… very cool too, nice work!
Wow, that’s a great test of the API - thanks for sharing your work.
Wow - did you give the API a test run or what!? That’s incredible - would be pretty cool if you could cache/save the values on a nightly basis and keep up to date stats.
Now there is also a little animation of the top 10 graph
http://youtube.com/watch?v=Aaj5011jLls
Thanks for sharing the numbers. I hope you had fun! I am always fascinated at people fascinated with numbers. Apparently, I am more fascinated at talking as the numbers show it…
Otir, thanks for your comment. I was playing with the Seesmic API and gathering some data. Doing some basic statistics was an easy thing once the data was there. And this did match nicely with the numbers thread on seesmic.
BTW, I would have explained this on seesmic, but unfortunately my camera does not work with the client. Need to change that…