Closed cavis closed 5 years ago
?url=<any-string>
example together with the key:
/i.gif?key=msn&url=https://www.pri.org/stories/2019-04-08/elections-loom-indias-modi-vows-end-terrorism-kashmir-more-military-force
And if the URL needs to be encoded, and/or relative. That works for me.
So the ?url
is the canonical url then (not the url it was accessed under, which is the "referer" header).
What is key=msn
representing in your example? Since I'm already confused, that probably indicates we need some better query-param / bigquery-field names here.
Yes, url is the canonical url.
For the key, I was just thinking that it would simply identify the source feed, because the canonical url could appear in multiple feeds.
Okay, plan is that this will track 6 things to bigquery:
?k=
key query param (any string)?c=
canonical-url query param (any string)user-agent
headerx-forwarded-for
header (or fall back to the source-ip of the request)referer
header
Start out with something similar to proxy.prx.org.
/i.gif
path?key=<any-string>
can be any string, and will end up under thekey
column in BigQuery