rustwasm / book

The Rust and WebAssembly Book
https://rustwasm.github.io/docs/book/
MIT License
1.73k stars 208 forks source link

First Google result points to "unpublished" book, which is identical to published one #189

Open kornelski opened 4 years ago

kornelski commented 4 years ago

To Reproduce Steps to reproduce the behavior:

  1. Google "rust wasm"
  2. Click on the first result — https://rustwasm.github.io/book/
  3. The page says in red "This is unpublished documentation of working with Rust and WebAssembly, the published documentation is available on the main Rust and WebAssembly documentation site . Features documented here may not be available in released versions of tooling for Rust and WebAssembly. "
  4. Clicking the " on the main Rust and WebAssembly documentation site " link takes to a seemingly identical page, on the same site, just without this warning.

Expected behavior I would like only relevant and accurate content to be hosted and promoted, not a mix of near-duplicate dead pages competing with real ones in search results.

Additional context If a page is outdated, just delete it. Use robots.txt to remove dead URLs from search engines.

kornelski commented 4 years ago

Ping. This is still broken.

Screenshot 2019-12-19 at 11 04 15
gabenodarse commented 4 years ago

It seems that this was done on purpose: https://github.com/rustwasm/book/commit/48d360e62cb57fb8c68c27653b96a72724745d0b

As you said, both pages appear to be identical. It's a simple thing to remove I just don't know if it's still serving a purpose.

kornelski commented 4 years ago

I'm not sure what's the intention here. It's confusing to publish something online, in search engines, and then call it unpublished.

If one of these copies is preferable, then the pages should have <link rel=canonical> pointing to the right one, or exclude less desirable copy from search engines using robots.txt or <meta name=robots content=noindex>.