“Have you ever wondered what language a blog entry you glanced at might be in?” was the question I set out to work on more than two years ago, if memory serves me right. I always get curious when I see a blog post in an unknown language. I mean not just a language I don’t speak – a language I can’t identify.
I thought it would be a really hard problem to solve – writing a piece of software that could figure that out. It turned out not to be so hard though. Just hours of programming, and probably a lot of luck. Because my initial hunches on how to tune the algorithms proved to be pretty right, and I was, and still am, really startled at how good the software became.
I released it as the Wørd – Language Analyzer Opera widget. Unfortunately the target audience for Opera widgets is quite small, so I always thought of making it into a web page. I don’t know why it took so long, but here it is!
The web page version has some new, cool improvements. It will try to detect as you’re typing, for instance. It also has improved support for Swedish, Serbian, and Afrikaans. And the UI is in my opinion better than the widget version.
So please try it yourself and see how it works. It’s pretty fun to just copy-paste any piece of text your can find on the Internet into it, or just type something in a language you know yourself and see when it gets it right. Here’s the address again: