tidyverse / haven

Read SPSS, Stata and SAS files from R
https://haven.tidyverse.org
Other
423 stars 115 forks source link

Release haven 2.5.0 #674

Closed gorcha closed 2 years ago

gorcha commented 2 years ago

Prepare for release:

Submit to CRAN:

Wait for CRAN...

gorcha commented 2 years ago

Over to you, @hadley

hadley commented 2 years ago

@gorcha revdepchecks look good. Would you mind suggesting a few new features that I should highlight in the blog post? I normally just skim the NEWS.md looking for the bullets that I think are likely to affect a decent number of people.

gorcha commented 2 years ago

Sure!

In general this release is focused on improvements and bug fixes for file writing.

Noteworthy new features are:

hadley commented 2 years ago

Do I remember correctly that the support for strL also considerably decreases file sizes?

gorcha commented 2 years ago

It can do in some cases with smaller character vectors but not universally (only noticeably where there's a lot of duplication). By far the biggest impact of adding strL support is that we can now write strings with more than 2045 characters.

One change that has a big Stata file size improvement is setting the minimum string variable width to 1 instead of 0 - ReadStat used str2045 for 0 width string variables previously, but these now use str1. Since str# uses a fixed amount of bytes per record this saves a huge amount of space in files with blank string variables.

hadley commented 2 years ago

Ah got it, thanks!