epeeme / web

Web site that presents results & ranking data from LPJS, Elite Epee, BYC, EYC and other UK youth, cadet & junior fencing competitions.
https://epee.me/
1 stars 1 forks source link

EFC historical results (pre-2010) #27

Open epeeme opened 4 years ago

epeeme commented 4 years ago

Start to track down and import data between 2005 & 2010 based on Rodneys archive of what was a selected cadet event. Non selected events should also be found but will need to compile a list somehow of all seasons - possibly from the waybackmachine archives of the EFC site.

epeeme commented 4 years ago

Bonn 2009 https://static-fencing-eu.s3-eu-west-1.amazonaws.com/uploads/attachments/091017-18_Bonn_EC_VH_RES.pdf

epeeme commented 4 years ago

Not EFC, but missing BRC from 2009 - http://britishfencing.com/uploads/files/2743_welshcadetbscepee2009opencadetmensepee.html

epeeme commented 4 years ago

https://web.archive.org/web/20100211132544/http://www.eurofencing.info/EfcEng/defaultE.htm

epeeme commented 4 years ago

https://web.archive.org/web/20080821194538fw_/http://www.eurofencing.info/EfcEng/Results/ECJ2005Tapolca/Epee_Masculin/EBJUN_FPBT_CLAS.htm

epeeme commented 4 years ago

https://web.archive.org/web/20080603035200/http://www.kff2006wcjc.com/Result/Result.html

epeeme commented 4 years ago

https://web.archive.org/web/20070726225351/http://www.fpe.pt/esgrima2004/

epeeme commented 4 years ago

https://web.archive.org/web/20070726230701/http://www.fencing.home.pl/WynikiENG.html

epeeme commented 4 years ago

Explore this - http://web.archive.org/web/20070213015150/http://www.fencingworldwide.com:8080/WAFechten/index.jsp?alle=ja

epeeme commented 4 years ago

http://web.archive.org/web/20050307042620/http://www.copenhagen2004.dk/sw176.asp

epeeme commented 4 years ago

https://www.eurofencing.info/competitions/championships/case:competitions/tournamentId:62

epeeme commented 4 years ago

http://www.mask-sport.com/ now has some historical data

epeeme commented 4 years ago

The old FIE site - https://web.archive.org/web/20080511115006/http://www.fie.ch/

epeeme commented 4 years ago

OK, so the key to success seems to be in extracting all of these https://web.archive.org/web/20100420014610/http://fie.ch/Competitions/ResultsList.aspx?Key=503D59A8157B2687B8C4CE2CAB517168 links with the different key values. They are all the JWC and EFC events that are no longer on the current official FIE site.

epeeme commented 4 years ago

So here it is, the method of extracting all the epee results from the old fie site :-)

The JSON data can be pulled from here - this is the call archive.org uses to generate its datatables output.

https://web.archive.org/web/timemap/?url=http%3A%2F%2Fwww.fie.ch%2F&matchType=prefix&collapse=urlkey&output=json&fl=original%2Cmimetype%2Ctimestamp%2Cendtimestamp%2Cgroupcount%2Cuniqcount&filter=!statuscode%3A%5B45%5D..&limit=100000&_=1579775059176

From this I can extract manually just the URLs we are interested in, which in this case it's just those that contain : Competitions/ResultsList.aspx?

https://pastebin.com/cqDUJfDQ

Having narrowed that down, I can use the Wayback Availability JSON API to make a call which will give us an extact URL to call

http://archive.org/wayback/available?url=www.fie.ch/Competitions/ResultsList.aspx?Key=007DB732C50F5AC4203A39A208FD4BC

{
    archived_snapshots: {
        closest: {
            available: true,
            url: "http://web.archive.org/web/20140527215548/http://www.fie.ch/Competitions/ResultsList.aspx?Key=007DB732C50F5AC4203A39A208FD4BC",
            timestamp: "20140527215548",
            status: "200"
        }
    },
    url: "www.fie.ch/Competitions/ResultsList.aspx?Key=007DB732C50F5AC4203A39A208FD4BC"
}

Using archived_snapshots.closest.url I can pull the source, parse the code for <span id="labWeaponDat" class="ListText">Epee</span>, which will identify if that URL is of interest, and then extract all the remaining data from the Table4 block.