I rely heavily on regular expressions when processing old data. Like my post about cleaning phone numbers, I’ll demonstrate another example of how useful regex can be when dealing with unruly data.
Posts Tagged ‘Regular Expressions’
IMDB Episode Scraper
Sunday, October 18th, 2009
I like to keep my TV rips well-organized, and labeled with the correct episode title. I’ve been using the excellent Bulk Rename Utility to quickly rename with proper series title, season and episode numbers—but the individual episode titles were always a pain. Begrudgingly, I’d just copy-paste the titles from Wikipedia, always wanting a way to automate that process.
Cleaning Phone Numbers with Regular Expressions
Sunday, August 30th, 2009
I’ve been in the position of having to take an unnormalized database that had virtually no data validation or standardization in place, and migrating it to a completely different, normalized schema. I wielded the power of regular expressions (regex) to help me through the process.