Closed tschaffter closed 1 year ago
const generator = SitemapGenerator(siteUrl, {
filepath: sitemapFilepath,
maxDepth: 2,
maxEntriesPerFile: 50000,
stripQuerystring: false
});
node tools/generate-sitemap.js http://www.sagebionetworks.org sitemap-sagebionetworks.xml
Number of <url>
: 29
node tools/generate-sitemap.js http://www.synapse.org sitemap-synapse.xml
Number of <url>
: 1
node tools/generate-sitemap.js https://agora.adknowledgeportal.org sitemap-agora.xml
Number of <url>
: 1
node tools/generate-sitemap.js https://adknowledgeportal.synapse.org sitemap-adknowledgeportal.xml
Number of <url>
: 1
node tools/generate-sitemap.js https://nf.synapse.org sitemap-nf.xml
Number of <url>
: 1
Note https://csbc-pson.synapse.org redirects to https://cancercomplexity.synapse.org/
node tools/generate-sitemap.js https://cancercomplexity.synapse.org sitemap-cancercomplexity.xml
Warning The crawler does not save the sitemap file when using https://csbc-pson.synapse.org (because of the redirection?).
Number of <url>
: 1
node tools/generate-sitemap.js https://www.cri-iatlas.org sitemap-iatlas.xml
Number of <url>
: 6
node tools/generate-sitemap.js https://isb-cgc.shinyapps.io/iatlas sitemap-iatlas-shiny.xml
Warning The crawler does not save the sitemap file (here there is no redirection).
sitemap.xml
file for https://csbc-pson.synapse.org but worked fine when targeting the redirected URL, https://cancercomplexity.synapse.org.sitemap.xml
file for https://isb-cgc.shinyapps.io/iatlas. It seems that the generator fails to generate a sitemap for Shiny apps as the same result was observed for the following Shiny apps:
Valid on 2022/08/04.
Site | Pages found | SSR enabled | sitemap.xml |
Technology | Contact |
---|---|---|---|---|---|
https://synapse.org | 1 | ❌ | ❌ | Google Web Toolkit | Jay Hodgson |
https://sagebionetworks.org | 29 | ✅ | ✅ | WordPress | |
https://www.cri-iatlas.org | 6 | ✅ | ✅ | WordPress | |
https://challenge-registry.org |
3 | ✅ | ✅ | Angular | Thomas Schaffter |
https://agora.adknowledgeportal.org | 1 | ❌ | ❌ | Angular | Anna Greenwood |
https://adknowledgeportal.synapse.org | 1 | ❌ | ✅ | React | Jay Hodgson |
https://nf.synapse.org | 1 | ❌ | ✅ (empty) | React | Jay Hodgson |
https://cancercomplexity.synapse.org | 1 | ❌ | ✅ | React | Jay Hodgson |
https://isb-cgc.shinyapps.io/iatlas | NA* | ✅ | ❌ | Shiny | Andrew Lamb |
*The crawler used seems to be unable to crawl Shiny apps.
sitemap.xml
file automatically, e.g. using the script tools/generate-sitemap.js
by this project.
The idea is to crawl these sites to identify whether they are optimized for SEO. We will use the crawler recently added to this repo.
Main:
Portals:
Shiny apps: