TheStanfordDaily / archives-text

Archives text for the Stanford Daily since 1892. Help us improve by submitting a pull request!
https://archives.stanforddaily.com/
0 stars 0 forks source link

Search strategy #1

Open epicfaace opened 5 years ago

epicfaace commented 5 years ago

File structure

Each file looks like the following:

# Article title
## Article subtitle
### Author name
content content content
content content content

File path

Each file is stored in a file path /yyxx/yyyx/yyyy/mm/dd/[title].[type].txt For example, /19xx/198x/1985/12/02/MODSMD_ARTICLE4.article.txt

Bitbucket queries

Query this URL: https://api.bitbucket.org/2.0/teams/%7B34f1ce51-8a37-4436-9c9c-dc71cb82f5d6%7D/search/code?search_query=path:1894%20leland%20stanford

Sample queries:

Query text Filters Query
leland 1900-1999 path:/19xx leland
leland 1990-1999 path:/199x leland
leland 1999 path:/19xx/199x/1999 leland
leland 1990-1998 path:/19xx/199x NOT path:/199x/199x/1998 leland