issues
search
medialab
/
hyphe
Websites crawler with built-in exploration and control web interface
http://hyphe.medialab.sciences-po.fr/demo/
GNU Affero General Public License v3.0
329
stars
59
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Update README.md
#500
bjornekstrom
closed
2 months ago
3
Investigate why some subdomains are not automatically set as such when using the related WECR
#499
boogheta
opened
4 months ago
0
download internal hyperlinks
#498
SeyedAlirezaMalih
opened
6 months ago
9
Pending for more URLS
#497
SeyedAlirezaMalih
closed
6 months ago
3
different results for same search, two years later
#496
sofiatipa
opened
11 months ago
7
[Front] Add an OK button to tags interface after inputting a value
#495
boogheta
closed
1 year ago
1
[Front] Crawl button should be blue after defining WEs ?
#494
boogheta
closed
1 year ago
0
The demo version won't crawl?
#493
Bseis
closed
1 year ago
3
Differentiate redirections/errors in crawl's counts
#492
boogheta
opened
1 year ago
0
Crawler pending
#491
Bseis
closed
1 year ago
2
Allow to setup scrapyd debug as docker env var
#490
boogheta
closed
1 year ago
0
Setup new WECR for youtube users
#489
boogheta
closed
1 year ago
1
Intégrer les fontes dans le code plutôt que sur google
#488
boogheta
closed
1 year ago
0
Complete initial OVERVIEW page with content such as HyBro's new tab content
#487
boogheta
opened
1 year ago
0
Advertise Hyphe release version or commit from API/Front
#486
boogheta
closed
1 year ago
0
Reenable to select WE in LIST by clicking NAME column at least
#485
boogheta
closed
1 year ago
0
Better links archives INA
#484
boogheta
closed
1 year ago
2
Better indicate loading pages activity when downloading CSV
#483
boogheta
closed
1 year ago
0
[NETWORK] Improve hovering legend
#482
boogheta
closed
1 year ago
0
[OVERVIEW] change the behaviour of the indexation advancement plot when index lags behind a lot
#481
boogheta
opened
1 year ago
0
[EXPORT] allow a Sitography format
#480
boogheta
opened
1 year ago
1
disable lookups on archives, or (better) actually use the archive to resolve redirections
#479
boogheta
opened
1 year ago
0
allow to mark a crawl as "processed" in the crawls list
#478
boogheta
opened
1 year ago
0
Fix ego network blinking
#477
boogheta
closed
1 year ago
0
Check values of new fields in Crawl metadata exports
#476
boogheta
closed
1 year ago
1
Cleanup traph directories after corpus destruction
#475
boogheta
closed
1 year ago
0
Add help on what are suspicious crawls
#474
boogheta
closed
1 year ago
0
[Text indexation] Unicode errors sometimes when indexing into ES
#473
boogheta
closed
1 year ago
0
Add ability to remove a web entity
#472
slauriere
closed
1 year ago
2
detection permalink web.archive not working with http version of web.archive
#471
boogheta
closed
1 year ago
0
Bug with network when tag category named type ?
#470
boogheta
closed
1 year ago
0
Link to web.archive url of a crawled page within a webentity page is not clickable
#469
boogheta
closed
1 year ago
0
Disable Suspicious crawl status on webentity crawled at the page level
#468
boogheta
closed
1 year ago
0
Bugs with advanced crawl options
#467
boogheta
closed
1 year ago
0
add actions pending warner on other pages such as StartCrawls
#466
boogheta
closed
1 year ago
0
Actions pending alert remains active after changing page
#465
boogheta
closed
1 year ago
0
Various Ideas from RESPADON Sprint
#464
boogheta
closed
1 year ago
0
Use ural's shorteners list
#463
boogheta
opened
2 years ago
0
"Node has no left sibling" when calling `paginate_webentity_pagelinks`
#462
dale-wahl
opened
2 years ago
6
Force specific User Agent per crawl
#461
paulgirard
closed
1 year ago
0
[crawls] Give exact datetimes on hover on human ones (such as 4 months ago)
#460
boogheta
closed
1 year ago
0
[network] allow to view selected tag when hovering in legend
#459
boogheta
closed
1 year ago
0
Enable/Disable multivalued tags
#458
boogheta
opened
2 years ago
0
Rename a corpus
#457
boogheta
opened
2 years ago
0
Allow to set and filter tags from listwebentities
#455
boogheta
opened
2 years ago
0
Add a tool to cleanup/merge entities
#454
boogheta
opened
2 years ago
0
Renew code to load recent user agents
#453
boogheta
closed
1 year ago
1
Give access from hyphe frontend to scrapyd's crawl logs
#452
boogheta
closed
1 year ago
0
Configuration of desktop version in macOS (12.3)
#451
Sofiatypa
closed
2 years ago
2
IMPORT add the option to load existing hyphe data/metas
#450
boogheta
opened
2 years ago
0
Next