Ryan
Court Jester
I mentioned this in another thread, but someone asked about the Sunset Limited into NOL, so I figured that it was worth a new thread.
Over the last few days I've been playing around with learning Python. As a project, I've set after taking the text files that Amtrak publishes with the train status for a day (one displayed below), parsing the times into a useful format and then storing the data such that I can manipulate, analyze and plot it.
It's been a cool learning project, and it wouldn't have been possible without John Bobinyec's Amtrak Maps site, which archives and makes available all of the text files.
There's an Amtrak Delay's site, but you can only look back 4 weeks at a time - once I'm done, I'll have the status of a train from 2006-2012 (and later, as John makes the text files available).
Here's what the Sunset Limited's arrival times into New Orleans look like. I shifted the data series when the schedule change took effect. It didn't do much to timekeeping until the early September timeframe, when something happened to make it significantly more better.
In addition to graphs, I can also pull stats out - for example in the other thread, I was able to show that the Palmetto arrived too late for the guy to make his bus connection less than 1% of the time.
I haven't worked it through yet, but I'd also like to put together plots where delays are plotted along the stations on a route on the x axis - by plotting more than one day's train at a time, eventually I'll end up with a spaghetti plot that may show some interesting trends. I'm thinking the first train I'm going to look at is the Cardinal and see just how bad the BBRR looks.
If there are other trains and connection's people are interested in, I'm happy to take a look. This isn't probably something that I'll ever have polished enough to turn into something for public use (I'm sure that anyone that has any formal training in programming would look at my mess of code and recoil in horror), and each step of the way I'm increasing the automation and error handling to make things go a little faster (I probably spent about 2 hours transitioning from looking at the Palmetto to the Sunset Limited).
So, what else do you think would be interesting to look at?
Over the last few days I've been playing around with learning Python. As a project, I've set after taking the text files that Amtrak publishes with the train status for a day (one displayed below), parsing the times into a useful format and then storing the data such that I can manipulate, analyze and plot it.
It's been a cool learning project, and it wouldn't have been possible without John Bobinyec's Amtrak Maps site, which archives and makes available all of the text files.
There's an Amtrak Delay's site, but you can only look back 4 weeks at a time - once I'm done, I'll have the status of a train from 2006-2012 (and later, as John makes the text files available).
Here's what the Sunset Limited's arrival times into New Orleans look like. I shifted the data series when the schedule change took effect. It didn't do much to timekeeping until the early September timeframe, when something happened to make it significantly more better.
In addition to graphs, I can also pull stats out - for example in the other thread, I was able to show that the Palmetto arrived too late for the guy to make his bus connection less than 1% of the time.
I haven't worked it through yet, but I'd also like to put together plots where delays are plotted along the stations on a route on the x axis - by plotting more than one day's train at a time, eventually I'll end up with a spaghetti plot that may show some interesting trends. I'm thinking the first train I'm going to look at is the Cardinal and see just how bad the BBRR looks.
If there are other trains and connection's people are interested in, I'm happy to take a look. This isn't probably something that I'll ever have polished enough to turn into something for public use (I'm sure that anyone that has any formal training in programming would look at my mess of code and recoil in horror), and each step of the way I'm increasing the automation and error handling to make things go a little faster (I probably spent about 2 hours transitioning from looking at the Palmetto to the Sunset Limited).
So, what else do you think would be interesting to look at?