Open traversc opened 1 month ago
Thanks @traversc for this note. Your decision is very respectable, although I would urge to keep qs
on CRAN as long as possible. Many packages have not, or cannot, move into full API compliance. Major packages including data.table would have great difficulties to do that. I don't see CRAN maintainers starting to crack down on non-complient packages, especially packages that are highly depended upon such as qs
.
Thanks @SebKrantz , for now CRAN isn't forcing the issue. I hope data.table gets the official support they need.
@traversc perhaps one more note here, the CRAN policy suggests that only major x.y.0 updates may be forced to fix all issues. So it should be possible to keep .qs
going with minor updates.
@SebKrantz Are you referring to this part?
Maintainers will be asked to update packages which show any warnings or significant notes, especially at around the time of a new x.y.0 release. Packages which are not updated are liable to be archived.
R 4.5 is scheduled for Spring which is hopefully enough time to gracefully deprecate everything.
I plan to deprecate the
qs
package in the future. There is a replacement available,qs2
, on CRAN and GitHub (1).There are two reasons:
New CRAN enforcement regarding certain internal functions that qs relies on (2). Since qs handles serialization for both data and internal objects, maintaining proper serialization has become difficult without broader access to these now-restricted functions.
qs
was first released in 2019. Since then, there have been numerous changes/improvements to the internals of R and therefore its serialization of internal objects. As a result, R updates have sometimes causedqs
to break in unexpected ways. Those breaks obviously cause disruption and have been time consuming to fix.The new
qs2
package addresses these issues.It uses only approved API functions and is designed to be more future-proof. The package has two new formats:
The
qs2
format uses R's built-in serialization but improves upon it with better file I/O, zstd compression, byte shuffling and multithreading. This is a good 80/20 solution and doesn`t require any update to ensure it works in the future.The
qdata
format, a spiritual successor toqs
, features its own serialization for data only (vectors, data frames, lists, matrices, attributes). It outperformsqs
andqs2
formats, especially with multithreading (3) and I also plan for limited cross-compatibility with Python later on.Thanks to everyone who used the
qs
package over the years and I hopeqs2
will be a worthy successor!(3) Benchmarks (4.5 GB mixed numeric/text data)
Single-threaded
Multi-threaded (8 threads)