More permanent stuff at http://www.dusbabek.org/~garyd

07 June 2004

FreeDb and Genres

Ok. I've been working on a tool that takes the flat file database available at freedb.org and puts it into a relational structure that can then be used by other tools for ID3 rectification.

The problem I've run into is one of data integrity. While I develop and test the software, I've only loaded a small subset of data into the database, namely that labled as "country" by the folks at freedb.

Well, I did a "select distinct genre from album" and came up with 438 different genres--all from the "country" database load.

Hmm.

This includes such genres as:
Country
Country (with extra spaces)
Countri (ok, I can accept french entries)
Counrty
Counntry
Coumtry
Coountry
Country & Western
COUNTRY

There are some other nice ones like:
Alt. Country
Alt Country
Alt-Country
Alt-country
Altern. Country
Altern.Country
Alternatif/Country
Alternative Country

The tool will only be as good as the data. So the first software I've created after the flat file loader is a maintenance tool to fix bad data. Eventually, when the software is in a usable state, I hope to post it to the website to see what others think about it.

0 comments: