Open stucka opened 11 months ago
Very interesting idea. Love the concept. It would help with searching for delays on these social networks.
I have two primary concerns:
LAX
delay, we will have a bunch of identical messages about hub airport airlines. It's mixing very dynamic data from the FAA with very static data about airline hub operations.
So sorry for the long message. Mainly just thinking out loud here.
In short, I really love this idea. I'm just trying to think through all the options and such to ensure it's the best it can possibly be.
What are your thoughts @stucka?
Thoughts:
LAX delays may affect hub operations for as many as seven airlines: AAY ASA AAL DAL JBU SWA UAL
That's still 96 characters. Could be even shorter, like 7 hubs may be at risk: AAY ASA AAL DAL JBU SWA UAL
.
I don't know what the data quality is on the Wikipedia page. Could certainly scrape it and turn it into something for Wikidata, no worries. I am not finding a more open data source than Wikipedia.
I'm not especially worried about using Wikipedia on a source on this, and I don't think the data would go stale particularly quickly or anything. Several of these airlines even dispute they have hubs vs. "focus cities" so there's certainly some ambiguity built into this anyway. Fuzzy data leads to judgment calls.
If you're pondering a web element, I think it'd depend what you want to put into it and how dynamic it would need to be? If you just want more, it could be something as basic as a shortened URL pointing to a Google Doc with links to those Flightaware pages and descriptions of how to read these reports or whatever you want. Or that could be, like, your pinned Tweet-like-post.
@sandrafish is already vowing me to buy me a beer for getting me roped into this LOL
LAX delays may affect hub operations for as many as seven airlines: AAY ASA AAL DAL JBU SWA UAL
My question here is how many users actually understand those abbreviations. On some level it's not about the number of characters, it's about the value of the message. And I think for some users this would be harder to understand and extract value from.
I don't know what the data quality is on the Wikipedia page. Could certainly scrape it and turn it into something for Wikidata, no worries. I am not finding a more open data source than Wikipedia.
Actually a lot of Wikidata sources are derived from Wikipedia. Just makes it easier to work with since it has an API, so you don't have to build a scrapper.
I think one other consideration here that I've thought about is doing some research into seeing if Bluesky & Mastodon support creating a replied post that doesn't show up on users timelines, but does show up on the post detail & search pages.
AFAIK that doesn't exist today. But might be worth looking into a bit more.
I think at the very least, this data would be useful, regardless of how we decide to integrate it into the bot itself.
Wikipedia maintains a list of hub airports in the U.S. here: https://en.wikipedia.org/wiki/List_of_hub_airports#United_States
If it'd help and there's interest I could maybe build a scraper for that and linked subpages to periodically update a list of airline names and ICAO codes, e.g.:
Perhaps followed by something like:
Alternately:
That could have some value in searches, with someone looking for "lax delay american airlines" or "dal 1234 delay".
A lower lift might be to simply write words like "Large airport delay!" where the CSV's "type" field is "large_airport" ... ?