Things you might find useful

Posts Tagged ‘Regular Expressions’

Filtering Out the Noise with Regex

Tuesday, July 13th, 2010

I rely heavily on regular expressions when processing old data. Like my post about cleaning phone numbers, I’ll demonstrate another example of how useful regex can be when dealing with unruly data.

Read the rest of this entry »

Tags: ,
Posted in Code | View Comments

IMDB Episode Scraper

Sunday, October 18th, 2009

I like to keep my TV rips well-organized, and labeled with the correct episode title. I’ve been using the excellent Bulk Rename Utility to quickly rename with proper series title, season and episode numbers—but the individual episode titles were always a pain. Begrudgingly, I’d just copy-paste the titles from Wikipedia, always wanting a way to automate that process.

Read the rest of this entry »

Tags: ,
Posted in Code | View Comments

Cleaning Phone Numbers with Regular Expressions

Sunday, August 30th, 2009

I’ve been in the position of having to take an unnormalized database that had virtually no data validation or standardization in place, and migrating it to a completely different, normalized schema. I wielded the power of regular expressions (regex) to help me through the process.

Read the rest of this entry »

Tags: ,
Posted in Code | View Comments