IRC log analysis for everyone!

Phil Bull philbull at gmail.com
Fri Jul 24 12:01:06 UTC 2009


Hi guys,

I finally got around to writing a half-decent* version of that IRC log
analysis script I was talking about a few weeks ago. It's pretty simple,
but it should be enough to get you up and running.

The script downloads the IRC logs for #ubuntu (or some other channel, if
you modify the script) and adds them to a local sqlite database. You can
then perform SQL queries on the database and mine the logs for common
questions and other useful information.

PLEASE only use this script if you're actually going to use the data for
something. The logs for #ubuntu for 2009 alone are about 200MB, and I
don't want to DOS irclogs.ubuntu.com by having loads of people
downloading unnecessarily.

I've provided a tarball here [1]. Patches and suggestions are welcome;
let me know if there are any bugs that you can't fix yourself, or if
you're struggling with the programming. If you find something
interesting, don't forget to share it with the list either!

Thanks,

Phil

[1] - http://philbull.googlepages.com/uirclog-2009-07-24.tar.gz

* As you can see from the code, half-decent is probably an
overstatement.

-- 
Phil Bull
https://launchpad.net/people/philbull





More information about the ubuntu-doc mailing list