HTTPArchive / almanac.httparchive.org

HTTP Archive's annual "State of the Web" report made by the web community
https://almanac.httparchive.org
Apache License 2.0
611 stars 170 forks source link

Compression 2021 #2160

Closed rviscomi closed 2 years ago

rviscomi commented 3 years ago

Part IV Chapter 22: Compression

Compression illustration

If you're interested in contributing to the Compression chapter of the 2021 Web Almanac, please reply to this issue and indicate which role or roles best fit your interest and availability: author, reviewer, analyst, and/or editor.

Content team

Lead Authors Reviewers Analysts Editors Coordinator
@lvandeve @lvandeve @jyrkialakuijala @paulcalvano @paulcalvano - - @rviscomi
Expand for more information about each role - The **[content team lead](https://github.com/HTTPArchive/almanac.httparchive.org/wiki/Content-Team-Leads'-Guide)** is the chapter owner and responsible for setting the scope of the chapter and managing contributors' day-to-day progress. - **[Authors](https://github.com/HTTPArchive/almanac.httparchive.org/wiki/Authors'-Guide)** are subject matter experts and lead the content direction for each chapter. Chapters typically have one or two authors. Authors are responsible for planning the outline of the chapter, analyzing stats and trends, and writing the annual report. - **[Reviewers](https://github.com/HTTPArchive/almanac.httparchive.org/wiki/Reviewers'-Guide)** are also subject matter experts and assist authors with technical reviews during the planning, analyzing, and writing phases. - **[Analysts](https://github.com/HTTPArchive/almanac.httparchive.org/wiki/Analysts'-Guide)** are responsible for researching the stats and trends used throughout the Almanac. Analysts work closely with authors and reviewers during the planning phase to give direction on the types of stats that are possible from the dataset, and during the analyzing/writing phases to ensure that the stats are used correctly. - **[Editors](https://github.com/HTTPArchive/almanac.httparchive.org/wiki/Editors'-Guide)** are technical writers who have a penchant for both technical and non-technical content correctness. Editors have a mastery of the English language and work closely with authors to help wordsmith content and ensure that everything fits together as a cohesive unit. - The **[section coordinator](https://github.com/HTTPArchive/almanac.httparchive.org/wiki/Section-Leads'-Guide)** is the overall owner for all chapters within a section like "User Experience" or "Page Content" and helps to keep each chapter on schedule. _Note: The time commitment for each role varies by the chapter's scope and complexity as well as the number of contributors._ For an overview of how the roles work together at each phase of the project, see the [Chapter Lifecycle](https://github.com/HTTPArchive/almanac.httparchive.org/wiki/Chapter-Lifecycle) doc.

Milestone checklist

0. Form the content team

1. Plan content

2. Gather data

3. Validate results

4. Draft content

5. Publication

Chapter resources

Refer to these 2021 Compression resources throughout the content creation process:

πŸ“„ Google Docs for outlining and drafting content πŸ” SQL files for committing the queries used during analysis πŸ“Š Google Sheets for saving the results of queries πŸ“ Markdown file for publishing content and managing public metadata

rviscomi commented 3 years ago

πŸ“Ÿ paging 2019/2020 contributors: @paulcalvano @yoavweiss @colinbendell @mo271 @jyrkialakuijala @sboukortt @veluca93 @AbbyTsai

Would any of you be interested to contribute to the 2021 chapter? I'd especially like to see more 2019/2020 authors become 2021 reviewers to help ease the transition and similarly I think prior reviewers would make great 2021 authors, being familiar with the process already. And prior analysts would make excellent 2021 analysts 😁

Or is there anyone new you'd like to see?

rviscomi commented 3 years ago

⚠️ Still looking for more contributors to make this chapter viable, specifically authors, reviewers, and analysts. If we're unable to staff the chapter by May 31 we'll have to close it and focus our efforts on other chapters.

jyrkialakuijala commented 3 years ago

I'd be happy to either review or author.

CΓ©dric Louvrier is doing very good work with his Pingo Image Optimization. He writes beautifully and with clarity on his forum posts. Consider including him into the process of creating the next Web Almanac if he has interest.

I hope that in next season Web Almanac will have an opportunity to cover JPEG XL and AVIF.

Probably some focus should go into how to deploy HDR and what is image quality and resolution in the internet.

rviscomi commented 3 years ago

@jyrkialakuijala sounds great, welcome back! Are you interested in being the content team lead for the chapter (responsible for overall content direction and staffing coauthors/reviewers as needed)?

paulcalvano commented 3 years ago

Awesome! Would love to contribute on this chapter as well. I'm happy to be an analyst and reviewer. Looking forward to working with you @jyrkialakuijala !

rviscomi commented 3 years ago

Thanks Paul! I've added you to the reviewer/analyst roles in the chapter metadata at the top. @jyrkialakuijala can you confirm if you're open to taking on the content team lead role for this chapter? You'd effectively be the lead author.

jyrkialakuijala commented 3 years ago

Thank you! I will happily take the content team lead role.

rviscomi commented 3 years ago

@jyrkialakuijala thanks for your interest in authoring this chapter! As the content team lead, you'll be responsible for the scope and direction of the chapter and keeping it on schedule. We automatically monitor the staffing and progress of each chapter based on the state of the initial comment so please keep that updated as you add new contributors and meet each milestone. First, you'll need to go to https://github.com/HTTPArchive/ to accept the invitation to join the GitHub team. That will give you the necessary permissions to edit this issue.

We've created a Google Doc for this chapter, which you're encouraged to use to collaborate with the content team on the initial outline, metrics, and ultimately the final draft.

Next steps for this chapter are:

There's not currently a section coordinator for this chapter, so I'll be periodically checking in with you directly to make sure the chapter is staying on schedule. Reach out here in this issue if you have any questions about the process.

More information about the content team lead and author roles and responsibilities are available for reference in the wiki if needed.

To anyone else still interested in contributing to this chapter, please comment below to join the team!

rviscomi commented 3 years ago

Hey @jyrkialakuijala, I know we've been emailing but could you give a status update on the outline in the doc? Do you expect you'll be able to get it done by the end of the week?

rviscomi commented 3 years ago

⚠️ @jyrkialakuijala any updates on the chapter outline?

lvandeve commented 3 years ago

Hi, I'm one of Jyrki's reports and would like to contribute.

The outline is the same as last year (with new text but comparable data), with the addition of information about compression levels, and I added in the doc under TOC under outline

For the metrics: it would be interesting to gather data about compression levels used (gzip level 1 to 9 or more optimized such as zopfli, brotli level 1 to 11). These levels are not directly stored in the data (but could be estimated with various methods), and I'd like to check: does it sound feasible to get a proxy for this from the data?

rviscomi commented 3 years ago

Welcome @lvandeve! I'll defer to @paulcalvano on the feasibility of the metrics, but I look forward to seeing the new data.

lvandeve commented 3 years ago

I'm going to drive the authoring side of the Compression 2021 chapter. I propose the following division for analysis:

@paulcalvano to run the same analysis as last year

@lvandeve with the help of other authors to run the new analysis on compression levels

rviscomi commented 3 years ago

πŸ‘‹ Hi @lvandeve @jyrkialakuijala @paulcalvano, just checking in on the chapter progress. How is the analysis coming along?

rviscomi commented 3 years ago

@lvandeve @paulcalvano I see that the results sheet has a lot of data already and Milestone 2 is checked off, but I don't see any queries added to this chapter's sql dir. Could you make sure that any queries used to generate results in the sheet are backed up to the repo?

Besides that, any other blockers or is the analysis generally complete?

rviscomi commented 3 years ago

@lvandeve @paulcalvano ping on my previous comment, any update on getting the PR started to check in the SQL?

paulcalvano commented 3 years ago

Hey @rviscomi . The analysis is complete, and I'll submit a PR with the queries shortly. Apologies for the delay.

rviscomi commented 2 years ago

@lvandeveΒ @jyrkialakuijala @paulcalvano

πŸŽ‰ This chapter is fully written, reviewed, edited, and ready to be launched on Wednesday! Thank you to all of the contributors who put in the time and effort to make this a great chapter.

When you get 5 minutes, I'd really appreciate if you could fill out our contributor survey to tell us (the project leads) about your experience. It's super helpful to hear what went well or what could be improved for next time. πŸ™

Congratulations and thank you all again. I'm excited for this to launch soon!