JuliaStats / RDatasets.jl

Julia package for loading many of the data sets available in R
GNU General Public License v3.0
159 stars 56 forks source link

update to julia-0.7/1.0 #54

Closed alyst closed 6 years ago

alyst commented 6 years ago

This PR requires JuliaData/RData.jl#44 to be merged and minimal RData version to be bumped.

andreasnoack commented 6 years ago

@alyst Could you please push a version with an updated RData requirement?

alyst commented 6 years ago

@andreasnoack Currently testing it locally, then will update the changes. BTW, do you know the fate of johnmileswhite/MNIST.jl? It wasn't touched for years, PRs ignored. It doesn't have much functionality, but it contains a medium-sized MNIST dataset that is used for testing/benchmarking TSne.jl. Copying these data to TSne.jl would make it too heavy. Maybe there's another Julia datasets package that already contains it? Or, if somebody can merge the PR, I can update MNIST.jl to 0.7/1.0. Or, the data could be moved to RDatasets (but we might not want to feed it into a mammoth).

andreasnoack commented 6 years ago

MNIST is in https://github.com/JuliaML/MLDatasets.jl which is actively maintained so I'd recommend using that version

alyst commented 6 years ago

Ah, good to now. Will switch to it. Maybe somebody could update MNIST.jl/README.md to redirect people to MLDatasets.jl? So far it's in the registry, so people will keep discovering it.

alyst commented 6 years ago

It seems there's a considerable CSV-related memory leak. When testing locally, my 24Gb RAM is not enough. When I skip CSV datasets, everything is much faster and memory usage is normal.

alyst commented 6 years ago

@andreasnoack FYI johnmyleswhite/MNIST.jl#30

andreasnoack commented 6 years ago

It seems there's a considerable CSV-related memory leak

Could you elaborate?

alyst commented 6 years ago

Could you elaborate?

While previously (v0.6) RDatasets tests were also running pretty slow, I don't recall I had memory issues. With this version, when testing locally, dataset.jl test script didn't finish and the memory usage was above 24GB (I suspect julia test process was killed, although there was no crash report). I don't think the RDatasets.jl is responsible for the leak. When I skip .csv datasets (i.e. testing .rda data only), everything is running fast. I don't know what is the cause. There's JuliaData/CSV.jl#236, which is worth investigating anyway, because allocation and CPU overhead for 2-row file is suspiciously high. However, I've tried reading that file 10000 times and I didn't observe increased memory usage. It might be something WeakRefStrings/InternedStrings related: unique strings from all datasets are cached (esp. during column type autodetection), but never freed.

I suspect Julia 1.0 Travis build failed because of the same issue (the other builds suffered from network problems). The stacktrace is strange:

[ Info: Running dataset.jl...
ERROR: Package RDatasets errored during testing
Stacktrace:
 [1] pkgerror(::String, ::Vararg{String,N} where N) at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.0/Pkg/src/Types.jl:120
 [2] macro expansion at ./logging.jl:313 [inlined]
 [3] #test#61(::Bool, ::Function, ::Pkg.Types.Context, ::Array{Pkg.Types.PackageSpec,1}) at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.0/Pkg/src/Operations.jl:1283
 [4] #test at ./none:0 [inlined]
 [5] #test#44(::Bool, ::Base.Iterators.Pairs{Union{},Union{},Tuple{},NamedTuple{(),Tuple{}}}, ::Function, ::Pkg.Types.Context, ::Array{Pkg.Types.PackageSpec,1}) at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.0/Pkg/src/API.jl:256
 [6] #test at ./none:0 [inlined]
 [7] #test#43 at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.0/Pkg/src/API.jl:239 [inlined]
 [8] #test at ./none:0 [inlined]
 [9] #test#42(::Base.Iterators.Pairs{Symbol,Bool,Tuple{Symbol},NamedTuple{(:coverage,),Tuple{Bool}}}, ::Function, ::Array{String,1}) at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.0/Pkg/src/API.jl:238
 [10] #test at ./none:0 [inlined]
 [11] #test#41 at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.0/Pkg/src/API.jl:237 [inlined]
 [12] (::getfield(Pkg.API, Symbol("#kw##test")))(::NamedTuple{(:coverage,),Tuple{Bool}}, ::typeof(Pkg.API.test), ::String) at ./none:0
 [13] top-level scope at none:0
XilinJia commented 6 years ago

I reported about the CSV issue with a 2-row file, and I just updated about it there.

Talking about csv memory leak, I have been battling for quite some time to find out the issue. Recently I inquired at the forum here. I can't pinpoint about the leak, but it seems very possibly related to csv related utilities.

alyst commented 6 years ago

This is how running ]test RDatasets looks like on my laptop until I kill it.

andreasnoack commented 6 years ago

@quinnj Do you have any idea what is happening here?

alyst commented 6 years ago

Some more detailed stats for the first few datasets (also a check that the tests are not just stuck in some loop):

[ Info: Running dataset.jl...
[ Info: COUNT::affairs (affairs.csv.gz): 601 rows, 18 columns, loaded in 0.825 s, 74549340 bytes allocated, 1401023 pool allocs, 7 malloc() calls
[ Info: COUNT::azdrg112 (azdrg112.csv.gz): 1798 rows, 4 columns, loaded in 0.271 s, 25136453 bytes allocated, 465146 pool allocs, 2 malloc() calls
[ Info: COUNT::azpro (azpro.csv.gz): 3589 rows, 6 columns, loaded in 0.343 s, 35851033 bytes allocated, 681613 pool allocs, 8 malloc() calls
[ Info: COUNT::badhealth (badhealth.csv.gz): 1127 rows, 3 columns, loaded in 0.256 s, 24009360 bytes allocated, 442758 pool allocs, 2 malloc() calls
[ Info: COUNT::fasttrakg (fasttrakg.rda): 15 rows, 9 columns, loaded in 1.669 s, 263158169 bytes allocated, 4971634 pool allocs, 10 malloc() calls
[ Info: COUNT::lbw (lbw.csv.gz): 189 rows, 10 columns, loaded in 0.361 s, 31625735 bytes allocated, 580346 pool allocs, 2 malloc() calls
[ Info: COUNT::lbwgrp (lbwgrp.csv.gz): 6 rows, 7 columns, loaded in 0.314 s, 27963002 bytes allocated, 511228 pool allocs, 2 malloc() calls
[ Info: COUNT::loomis (loomis.csv.gz): 410 rows, 11 columns, loaded in 0.366 s, 32880882 bytes allocated, 605149 pool allocs, 2 malloc() calls
[ Info: COUNT::mdvis (mdvis.csv.gz): 2227 rows, 13 columns, loaded in 0.384 s, 36350608 bytes allocated, 691317 pool allocs, 15 malloc() calls
[ Info: COUNT::medpar (medpar.csv.gz): 1495 rows, 10 columns, loaded in 0.243 s, 16189138 bytes allocated, 300746 pool allocs, 2 malloc() calls
[ Info: COUNT::rwm (rwm.csv.gz): 27326 rows, 4 columns, loaded in 0.221 s, 25244359 bytes allocated, 596697 pool allocs, 7 malloc() calls
[ Info: COUNT::rwm5yr (rwm5yr.rda): 19609 rows, 17 columns, loaded in 0.668 s, 128525118 bytes allocated, 2462780 pool allocs, 21 malloc() calls
[ Info: COUNT::ships (ships.csv.gz): 40 rows, 7 columns, loaded in 0.174 s, 13487416 bytes allocated, 241661 pool allocs, 4 malloc() calls
[ Info: COUNT::titanic (titanic.csv.gz): 1316 rows, 4 columns, loaded in 0.117 s, 9342787 bytes allocated, 172456 pool allocs, 2 malloc() calls
[ Info: COUNT::titanicgrp (titanicgrp.rda): 12 rows, 5 columns, loaded in 0.000 s, 123824 bytes allocated, 1011 pool allocs, 2 malloc() calls
[ Info: Ecdat::Accident (Accident.rda): 40 rows, 5 columns, loaded in 0.003 s, 188382 bytes allocated, 1982 pool allocs, 2 malloc() calls
[ Info: Ecdat::Airline (Airline.csv.gz): 90 rows, 6 columns, loaded in 0.217 s, 18973090 bytes allocated, 345288 pool allocs, 2 malloc() calls
[ Info: Ecdat::Airq (Airq.rda): 30 rows, 6 columns, loaded in 0.000 s, 81600 bytes allocated, 488 pool allocs, 2 malloc() calls
[ Info: Ecdat::Benefits (Benefits.rda): 4877 rows, 18 columns, loaded in 0.003 s, 1028912 bytes allocated, 16017 pool allocs, 21 malloc() calls
[ Info: Ecdat::Bids (Bids.csv.gz): 126 rows, 12 columns, loaded in 0.384 s, 34781742 bytes allocated, 634636 pool allocs, 2 malloc() calls
[ Info: Ecdat::BudgetFood (BudgetFood.rda): 23972 rows, 6 columns, loaded in 0.018 s, 3117200 bytes allocated, 82361 pool allocs, 10 malloc() calls
[ Info: Ecdat::BudgetItaly (BudgetItaly.csv.gz): 1729 rows, 11 columns, loaded in 0.385 s, 34113076 bytes allocated, 646368 pool allocs, 4 malloc() calls
[ Info: Ecdat::BudgetUK (BudgetUK.csv.gz): 1519 rows, 10 columns, loaded in 0.303 s, 25538993 bytes allocated, 480157 pool allocs, 2 malloc() calls
[ Info: Ecdat::Bwages (Bwages.rda): 1472 rows, 4 columns, loaded in 0.001 s, 214784 bytes allocated, 4791 pool allocs, 2 malloc() calls
[ Info: Ecdat::CPSch3 (CPSch3.rda): 11130 rows, 3 columns, loaded in 0.002 s, 1064544 bytes allocated, 33751 pool allocs, 6 malloc() calls
[ Info: Ecdat::Capm (Capm.csv.gz): 516 rows, 5 columns, loaded in 0.272 s, 19541042 bytes allocated, 356518 pool allocs, 2 malloc() calls
[ Info: Ecdat::Car (Car.rda): 4654 rows, 70 columns, loaded in 0.006 s, 2626528 bytes allocated, 16463 pool allocs, 73 malloc() calls
[ Info: Ecdat::Caschool (Caschool.rda): 420 rows, 17 columns, loaded in 0.138 s, 18197772 bytes allocated, 341979 pool allocs, 6 malloc() calls
[ Info: Ecdat::Catsup (Catsup.rda): 2798 rows, 14 columns, loaded in 0.001 s, 523216 bytes allocated, 8942 pool allocs, 7 malloc() calls
[ Info: Ecdat::Cigar (Cigar.csv.gz): 1380 rows, 9 columns, loaded in 0.350 s, 31654602 bytes allocated, 590241 pool allocs, 2 malloc() calls
[ Info: Ecdat::Cigarette (Cigarette.rda): 528 rows, 9 columns, loaded in 0.001 s, 162768 bytes allocated, 2213 pool allocs, 2 malloc() calls
[ Info: Ecdat::Clothing (Clothing.csv.gz): 400 rows, 13 columns, loaded in 0.303 s, 27308581 bytes allocated, 497716 pool allocs, 2 malloc() calls
[ Info: Ecdat::Computers (Computers.rda): 6259 rows, 10 columns, loaded in 0.002 s, 850336 bytes allocated, 19419 pool allocs, 13 malloc() calls
[ Info: Ecdat::Cracker (Cracker.rda): 3292 rows, 14 columns, loaded in 0.007 s, 595856 bytes allocated, 10427 pool allocs, 7 malloc() calls
[ Info: Ecdat::Crime (Crime.rda): 630 rows, 24 columns, loaded in 0.001 s, 262832 bytes allocated, 2596 pool allocs, 2 malloc() calls
[ Info: Ecdat::DM (DM.csv.gz): 778 rows, 4 columns, loaded in 0.190 s, 17315802 bytes allocated, 317269 pool allocs, 2 malloc() calls
[ Info: Ecdat::Diamond (Diamond.rda): 308 rows, 5 columns, loaded in 0.001 s, 131696 bytes allocated, 1497 pool allocs, 2 malloc() calls
[ Info: Ecdat::Doctor (Doctor.csv.gz): 485 rows, 4 columns, loaded in 0.125 s, 10578833 bytes allocated, 193112 pool allocs, 2 malloc() calls
[ Info: Ecdat::DoctorAUS (DoctorAUS.rda): 5190 rows, 15 columns, loaded in 0.002 s, 877488 bytes allocated, 16235 pool allocs, 18 malloc() calls
[ Info: Ecdat::DoctorContacts (DoctorContacts.rda): 20186 rows, 15 columns, loaded in 0.040 s, 4198478 bytes allocated, 73784 pool allocs, 24 malloc() calls
[ Info: Ecdat::Earnings (Earnings.rda): 4266 rows, 2 columns, loaded in 0.001 s, 430864 bytes allocated, 13145 pool allocs, 5 malloc() calls
[ Info: Ecdat::Electricity (Electricity.csv.gz): 158 rows, 8 columns, loaded in 0.318 s, 29648713 bytes allocated, 541556 pool allocs, 2 malloc() calls
[ Info: Ecdat::Fair (Fair.rda): 601 rows, 9 columns, loaded in 0.001 s, 178464 bytes allocated, 2334 pool allocs, 2 malloc() calls
[ Info: Ecdat::Fatality (Fatality.rda): 336 rows, 10 columns, loaded in 0.000 s, 140448 bytes allocated, 1528 pool allocs, 2 malloc() calls
[ Info: Ecdat::Fishing (Fishing.rda): 1182 rows, 12 columns, loaded in 0.001 s, 271136 bytes allocated, 4021 pool allocs, 2 malloc() calls
[ Info: Ecdat::Forward (Forward.csv.gz): 276 rows, 9 columns, loaded in 0.284 s, 23727398 bytes allocated, 429352 pool allocs, 2 malloc() calls
[ Info: Ecdat::FriendFoe (FriendFoe.rda): 227 rows, 13 columns, loaded in 0.001 s, 209968 bytes allocated, 1740 pool allocs, 2 malloc() calls
[ Info: Ecdat::Garch (Garch.csv.gz): 1867 rows, 8 columns, loaded in 0.298 s, 26731408 bytes allocated, 510386 pool allocs, 9 malloc() calls
[ Info: Ecdat::Gasoline (Gasoline.rda): 342 rows, 6 columns, loaded in 0.001 s, 115792 bytes allocated, 1481 pool allocs, 2 malloc() calls
[ Info: Ecdat::Griliches (Griliches.rda): 758 rows, 20 columns, loaded in 0.001 s, 323328 bytes allocated, 3292 pool allocs, 2 malloc() calls
[ Info: Ecdat::Grunfeld (Grunfeld.csv.gz): 200 rows, 5 columns, loaded in 0.227 s, 19499108 bytes allocated, 355769 pool allocs, 2 malloc() calls
[ Info: Ecdat::HC (HC.rda): 250 rows, 18 columns, loaded in 0.001 s, 190592 bytes allocated, 1392 pool allocs, 2 malloc() calls
[ Info: Ecdat::HI (HI.rda): 22272 rows, 13 columns, loaded in 0.014 s, 3272880 bytes allocated, 67848 pool allocs, 23 malloc() calls
[ Info: Ecdat::Hdma (Hdma.rda): 2381 rows, 13 columns, loaded in 0.002 s, 514048 bytes allocated, 8041 pool allocs, 9 malloc() calls
[ Info: Ecdat::Heating (Heating.rda): 900 rows, 21 columns, loaded in 0.004 s, 309264 bytes allocated, 3396 pool allocs, 2 malloc() calls
[ Info: Ecdat::Hedonic (Hedonic.rda): 506 rows, 15 columns, loaded in 0.001 s, 178032 bytes allocated, 2014 pool allocs, 2 malloc() calls
[ Info: Ecdat::Housing (Housing.rda): 546 rows, 12 columns, loaded in 0.001 s, 237968 bytes allocated, 2540 pool allocs, 2 malloc() calls
[ Info: Ecdat::Icecream (Icecream.csv.gz): 30 rows, 4 columns, loaded in 0.197 s, 17490635 bytes allocated, 319103 pool allocs, 2 malloc() calls
[ Info: Ecdat::Journals (Journals.rda): 180 rows, 10 columns, loaded in 0.001 s, 175920 bytes allocated, 2006 pool allocs, 2 malloc() calls
[ Info: Ecdat::Kakadu (Kakadu.rda): 1827 rows, 22 columns, loaded in 0.001 s, 524000 bytes allocated, 6560 pool allocs, 2 malloc() calls
[ Info: Ecdat::Ketchup (Ketchup.rda): 4956 rows, 7 columns, loaded in 0.001 s, 660080 bytes allocated, 15294 pool allocs, 10 malloc() calls
[ Info: Ecdat::Klein (Klein.csv.gz): 22 rows, 10 columns, loaded in 0.283 s, 22988003 bytes allocated, 416894 pool allocs, 2 malloc() calls
[ Info: Ecdat::LaborSupply (LaborSupply.csv.gz): 5320 rows, 7 columns, loaded in 0.258 s, 23580692 bytes allocated, 476908 pool allocs, 9 malloc() calls
[ Info: Ecdat::Labour (Labour.csv.gz): 569 rows, 4 columns, loaded in 0.210 s, 17250634 bytes allocated, 314239 pool allocs, 2 malloc() calls
[ Info: Ecdat::MCAS (MCAS.csv.gz): 220 rows, 17 columns, loaded in 0.609 s, 57234670 bytes allocated, 998972 pool allocs, 2 malloc() calls
[ Info: Ecdat::Males (Males.rda): 4360 rows, 12 columns, loaded in 0.001 s, 748192 bytes allocated, 14146 pool allocs, 15 malloc() calls
[ Info: Ecdat::Mathlevel (Mathlevel.rda): 609 rows, 8 columns, loaded in 0.001 s, 195904 bytes allocated, 2538 pool allocs, 2 malloc() calls
[ Info: Ecdat::MedExp (MedExp.rda): 5574 rows, 15 columns, loaded in 0.002 s, 1113984 bytes allocated, 17648 pool allocs, 18 malloc() calls
[ Info: Ecdat::Metal (Metal.csv.gz): 27 rows, 3 columns, loaded in 0.212 s, 17335008 bytes allocated, 317336 pool allocs, 2 malloc() calls
[ Info: Ecdat::Mode (Mode.rda): 453 rows, 9 columns, loaded in 0.001 s, 135856 bytes allocated, 1782 pool allocs, 2 malloc() calls
[ Info: Ecdat::ModeChoice (ModeChoice.csv.gz): 840 rows, 7 columns, loaded in 0.166 s, 13654856 bytes allocated, 249441 pool allocs, 2 malloc() calls
[ Info: Ecdat::Mofa (Mofa.csv.gz): 50 rows, 6 columns, loaded in 0.255 s, 22406013 bytes allocated, 409821 pool allocs, 2 malloc() calls
[ Info: Ecdat::Mroz (Mroz.rda): 753 rows, 18 columns, loaded in 0.001 s, 268864 bytes allocated, 2939 pool allocs, 2 malloc() calls
[ Info: Ecdat::MunExp (MunExp.csv.gz): 2385 rows, 5 columns, loaded in 0.142 s, 11923108 bytes allocated, 229824 pool allocs, 7 malloc() calls
[ Info: Ecdat::NaturalPark (NaturalPark.rda): 312 rows, 7 columns, loaded in 0.000 s, 137808 bytes allocated, 1449 pool allocs, 2 malloc() calls
[ Info: Ecdat::Nerlove (Nerlove.csv.gz): 159 rows, 8 columns, loaded in 0.238 s, 20741441 bytes allocated, 377190 pool allocs, 2 malloc() calls
[ Info: Ecdat::OFP (OFP.rda): 4406 rows, 19 columns, loaded in 0.002 s, 935440 bytes allocated, 14405 pool allocs, 22 malloc() calls
[ Info: Ecdat::Oil (Oil.csv.gz): 53 rows, 11 columns, loaded in 0.299 s, 24115913 bytes allocated, 437435 pool allocs, 2 malloc() calls
[ Info: Ecdat::PSID (PSID.rda): 4856 rows, 8 columns, loaded in 0.001 s, 637696 bytes allocated, 15049 pool allocs, 12 malloc() calls
[ Info: Ecdat::Participation (Participation.rda): 872 rows, 7 columns, loaded in 0.001 s, 193200 bytes allocated, 3113 pool allocs, 2 malloc() calls
[ Info: Ecdat::PatentsHGH (PatentsHGH.rda): 1730 rows, 18 columns, loaded in 0.001 s, 424160 bytes allocated, 5779 pool allocs, 2 malloc() calls
[ Info: Ecdat::PatentsRD (PatentsRD.csv.gz): 1629 rows, 7 columns, loaded in 0.238 s, 20020809 bytes allocated, 376509 pool allocs, 2 malloc() calls
[ Info: Ecdat::Pound (Pound.csv.gz): 778 rows, 4 columns, loaded in 0.001 s, 442544 bytes allocated, 9496 pool allocs, 2 malloc() calls
[ Info: Ecdat::Produc (Produc.rda): 816 rows, 10 columns, loaded in 0.001 s, 205824 bytes allocated, 3084 pool allocs, 2 malloc() calls
[ Info: Ecdat::RetSchool (RetSchool.csv.gz): 5225 rows, 17 columns, loaded in 0.527 s, 52003610 bytes allocated, 1010024 pool allocs, 19 malloc() calls
[ Info: Ecdat::SP500 (SP500.csv.gz): 2783 rows, 1 columns, loaded in 0.247 s, 21898018 bytes allocated, 401666 pool allocs, 4 malloc() calls
[ Info: Ecdat::Schooling (Schooling.rda): 3010 rows, 28 columns, loaded in 0.002 s, 1007424 bytes allocated, 11044 pool allocs, 6 malloc() calls
[ Info: Ecdat::Somerville (Somerville.rda): 659 rows, 8 columns, loaded in 0.001 s, 175888 bytes allocated, 2486 pool allocs, 2 malloc() calls
[ Info: Ecdat::Star (Star.rda): 5748 rows, 8 columns, loaded in 0.001 s, 750288 bytes allocated, 17933 pool allocs, 11 malloc() calls
[ Info: Ecdat::Strike (Strike.csv.gz): 62 rows, 2 columns, loaded in 0.196 s, 17053909 bytes allocated, 311077 pool allocs, 2 malloc() calls
[ Info: Ecdat::StrikeDur (StrikeDur.csv.gz): 566 rows, 2 columns, loaded in 0.090 s, 7944274 bytes allocated, 144763 pool allocs, 2 malloc() calls
[ Info: Ecdat::StrikeNb (StrikeNb.csv.gz): 108 rows, 3 columns, loaded in 0.206 s, 16853001 bytes allocated, 307322 pool allocs, 2 malloc() calls
[ Info: Ecdat::SumHes (SumHes.rda): 3250 rows, 7 columns, loaded in 0.001 s, 477632 bytes allocated, 10824 pool allocs, 4 malloc() calls
[ Info: Ecdat::Tobacco (Tobacco.rda): 2724 rows, 9 columns, loaded in 0.001 s, 435856 bytes allocated, 8708 pool allocs, 6 malloc() calls
[ Info: Ecdat::Train (Train.rda): 2929 rows, 11 columns, loaded in 0.001 s, 463344 bytes allocated, 9292 pool allocs, 3 malloc() calls
[ Info: Ecdat::TranspEq (TranspEq.csv.gz): 25 rows, 5 columns, loaded in 0.210 s, 18803623 bytes allocated, 343585 pool allocs, 2 malloc() calls
[ Info: Ecdat::Treatment (Treatment.rda): 2675 rows, 10 columns, loaded in 0.001 s, 427408 bytes allocated, 8479 pool allocs, 6 malloc() calls
[ Info: Ecdat::Tuna (Tuna.rda): 13705 rows, 8 columns, loaded in 0.003 s, 1795184 bytes allocated, 41558 pool allocs, 11 malloc() calls
[ Info: Ecdat::UnempDur (UnempDur.rda): 3343 rows, 11 columns, loaded in 0.001 s, 537776 bytes allocated, 10522 pool allocs, 6 malloc() calls
[ Info: Ecdat::Unemployment (Unemployment.rda): 452 rows, 12 columns, loaded in 0.001 s, 217152 bytes allocated, 2188 pool allocs, 2 malloc() calls
[ Info: Ecdat::University (University.csv.gz): 62 rows, 17 columns, loaded in 0.510 s, 46674144 bytes allocated, 802762 pool allocs, 2 malloc() calls
[ Info: Ecdat::VietNamI (VietNamI.rda): 27765 rows, 12 columns, loaded in 0.007 s, 3700704 bytes allocated, 83819 pool allocs, 16 malloc() calls
[ Info: Ecdat::Wages (Wages.rda): 4165 rows, 12 columns, loaded in 0.002 s, 718992 bytes allocated, 13474 pool allocs, 15 malloc() calls
[ Info: Ecdat::Wages1 (Wages1.rda): 3294 rows, 4 columns, loaded in 0.001 s, 384400 bytes allocated, 10258 pool allocs, 4 malloc() calls
[ Info: Ecdat::Workinghours (Workinghours.rda): 3382 rows, 12 columns, loaded in 0.001 s, 534992 bytes allocated, 10676 pool allocs, 3 malloc() calls
[ Info: Ecdat::Yen (Yen.csv.gz): 778 rows, 4 columns, loaded in 0.002 s, 442528 bytes allocated, 9496 pool allocs, 2 malloc() calls
[ Info: Ecdat::Yogurt (Yogurt.rda): 2412 rows, 10 columns, loaded in 0.001 s, 406224 bytes allocated, 7710 pool allocs, 7 malloc() calls
[ Info: Ecdat::incomeInequality (incomeInequality.csv.gz): 66 rows, 22 columns, loaded in 7.230 s, 582754445 bytes allocated, 9222757 pool allocs, 2 malloc() calls
[ Info: HSAUR::BCG (BCG.csv.gz): 13 rows, 7 columns, loaded in 0.162 s, 12723883 bytes allocated, 229608 pool allocs, 2 malloc() calls
[ Info: HSAUR::BtheB (BtheB.rda): 100 rows, 8 columns, loaded in 0.000 s, 133376 bytes allocated, 919 pool allocs, 2 malloc() calls
[ Info: HSAUR::CYGOB1 (CYGOB1.csv.gz): 47 rows, 2 columns, loaded in 0.193 s, 16341283 bytes allocated, 299334 pool allocs, 2 malloc() calls
[ Info: HSAUR::Forbes2000 (Forbes2000.rda): 2000 rows, 8 columns, loaded in 0.001 s, 514752 bytes allocated, 12848 pool allocs, 3 malloc() calls
[ Info: HSAUR::GHQ (GHQ.rda): 22 rows, 4 columns, loaded in 0.000 s, 83696 bytes allocated, 445 pool allocs, 2 malloc() calls
[ Info: HSAUR::Lanza (Lanza.rda): 198 rows, 3 columns, loaded in 0.000 s, 112528 bytes allocated, 1131 pool allocs, 2 malloc() calls
[ Info: HSAUR::agefat (agefat.rda): 25 rows, 3 columns, loaded in 0.000 s, 74656 bytes allocated, 432 pool allocs, 2 malloc() calls
[ Info: HSAUR::aspirin (aspirin.csv.gz): 7 rows, 5 columns, loaded in 0.225 s, 18337189 bytes allocated, 333830 pool allocs, 2 malloc() calls
[ Info: HSAUR::birthdeathrates (birthdeathrates.csv.gz): 69 rows, 3 columns, loaded in 0.210 s, 16555964 bytes allocated, 302189 pool allocs, 2 malloc() calls
[ Info: HSAUR::bladdercancer (bladdercancer.rda): 31 rows, 3 columns, loaded in 0.000 s, 79568 bytes allocated, 455 pool allocs, 2 malloc() calls
[ Info: HSAUR::clouds (clouds.rda): 24 rows, 7 columns, loaded in 0.000 s, 90704 bytes allocated, 558 pool allocs, 2 malloc() calls
[ Info: HSAUR::epilepsy (epilepsy.rda): 236 rows, 7 columns, loaded in 0.000 s, 161824 bytes allocated, 2238 pool allocs, 2 malloc() calls
[ Info: HSAUR::foster (foster.rda): 61 rows, 3 columns, loaded in 0.000 s, 87152 bytes allocated, 632 pool allocs, 2 malloc() calls
[ Info: HSAUR::heptathlon (heptathlon.csv.gz): 25 rows, 9 columns, loaded in 0.273 s, 23435802 bytes allocated, 426818 pool allocs, 2 malloc() calls
[ Info: HSAUR::mastectomy (mastectomy.rda): 44 rows, 3 columns, loaded in 0.000 s, 76256 bytes allocated, 490 pool allocs, 2 malloc() calls
[ Info: HSAUR::meteo (meteo.csv.gz): 11 rows, 6 columns, loaded in 0.208 s, 19230530 bytes allocated, 350556 pool allocs, 2 malloc() calls
[ Info: HSAUR::orallesions (orallesions.csv.gz): 8 rows, 4 columns, loaded in 0.220 s, 18113794 bytes allocated, 330018 pool allocs, 2 malloc() calls
[ Info: HSAUR::phosphate (phosphate.rda): 33 rows, 9 columns, loaded in 0.000 s, 75936 bytes allocated, 523 pool allocs, 2 malloc() calls
[ Info: HSAUR::pistonrings (pistonrings.csv.gz): 4 rows, 4 columns, loaded in 0.123 s, 10594781 bytes allocated, 188481 pool allocs, 3 malloc() calls
[ Info: HSAUR::planets (planets.csv.gz): 101 rows, 3 columns, loaded in 0.109 s, 9532890 bytes allocated, 172098 pool allocs, 2 malloc() calls
[ Info: HSAUR::plasma (plasma.rda): 32 rows, 4 columns, loaded in 0.000 s, 78240 bytes allocated, 562 pool allocs, 2 malloc() calls
[ Info: HSAUR::polyps (polyps.rda): 20 rows, 3 columns, loaded in 0.000 s, 78496 bytes allocated, 422 pool allocs, 2 malloc() calls
[ Info: HSAUR::polyps3 (polyps3.rda): 22 rows, 5 columns, loaded in 0.000 s, 97568 bytes allocated, 538 pool allocs, 2 malloc() calls
[ Info: HSAUR::pottery (pottery.csv.gz): 45 rows, 9 columns, loaded in 0.205 s, 15407388 bytes allocated, 277994 pool allocs, 2 malloc() calls
[ Info: HSAUR::rearrests (rearrests.csv.gz): 2 rows, 3 columns, loaded in 0.179 s, 15954933 bytes allocated, 290550 pool allocs, 2 malloc() calls
[ Info: HSAUR::respiratory (respiratory.rda): 555 rows, 8 columns, loaded in 0.007 s, 258896 bytes allocated, 4613 pool allocs, 2 malloc() calls
[ Info: HSAUR::roomwidth (roomwidth.rda): 113 rows, 2 columns, loaded in 0.000 s, 81008 bytes allocated, 684 pool allocs, 2 malloc() calls
[ Info: HSAUR::schizophrenia (schizophrenia.rda): 251 rows, 2 columns, loaded in 0.000 s, 92320 bytes allocated, 1097 pool allocs, 2 malloc() calls
[ Info: HSAUR::schizophrenia2 (schizophrenia2.rda): 220 rows, 5 columns, loaded in 0.000 s, 145568 bytes allocated, 2041 pool allocs, 2 malloc() calls
[ Info: HSAUR::schooldays (schooldays.rda): 154 rows, 5 columns, loaded in 0.000 s, 128128 bytes allocated, 1094 pool allocs, 2 malloc() calls
[ Info: HSAUR::skulls (skulls.rda): 150 rows, 5 columns, loaded in 0.000 s, 101408 bytes allocated, 861 pool allocs, 2 malloc() calls
[ Info: HSAUR::smoking (smoking.csv.gz): 26 rows, 5 columns, loaded in 0.124 s, 10800461 bytes allocated, 194988 pool allocs, 2 malloc() calls
[ Info: HSAUR::students (students.rda): 35 rows, 3 columns, loaded in 0.000 s, 79968 bytes allocated, 471 pool allocs, 2 malloc() calls
[ Info: HSAUR::suicides (suicides.csv.gz): 2 rows, 3 columns, loaded in 0.102 s, 9482306 bytes allocated, 170953 pool allocs, 2 malloc() calls
[ Info: HSAUR::toothpaste (toothpaste.csv.gz): 9 rows, 7 columns, loaded in 0.237 s, 19254505 bytes allocated, 350002 pool allocs, 2 malloc() calls
[ Info: HSAUR::voting (voting.csv.gz): 15 rows, 16 columns, loaded in 0.494 s, 46557797 bytes allocated, 825409 pool allocs, 2 malloc() calls
[ Info: HSAUR::water (water.rda): 61 rows, 4 columns, loaded in 0.000 s, 87248 bytes allocated, 742 pool allocs, 2 malloc() calls
[ Info: HSAUR::watervoles (watervoles.csv.gz): 14 rows, 15 columns, loaded in 0.454 s, 41553527 bytes allocated, 743525 pool allocs, 4 malloc() calls
[ Info: HSAUR::waves (waves.csv.gz): 18 rows, 2 columns, loaded in 0.104 s, 8521223 bytes allocated, 153697 pool allocs, 2 malloc() calls
[ Info: HSAUR::weightgain (weightgain.rda): 40 rows, 3 columns, loaded in 0.000 s, 89040 bytes allocated, 558 pool allocs, 2 malloc() calls
[ Info: HSAUR::womensrole (womensrole.rda): 42 rows, 4 columns, loaded in 0.000 s, 85552 bytes allocated, 505 pool allocs, 2 malloc() calls
[ Info: HistData::Arbuthnot (Arbuthnot.csv.gz): 82 rows, 7 columns, loaded in 0.144 s, 12858864 bytes allocated, 231917 pool allocs, 2 malloc() calls
[ Info: HistData::Bowley (Bowley.csv.gz): 45 rows, 2 columns, loaded in 0.084 s, 7901906 bytes allocated, 142491 pool allocs, 2 malloc() calls
[ Info: HistData::Cavendish (Cavendish.csv.gz): 29 rows, 3 columns, loaded in 0.127 s, 9499002 bytes allocated, 171470 pool allocs, 2 malloc() calls
[ Info: HistData::ChestSizes (ChestSizes.csv.gz): 16 rows, 2 columns, loaded in 0.175 s, 15138857 bytes allocated, 277055 pool allocs, 2 malloc() calls
[ Info: HistData::CushnyPeebles (CushnyPeebles.csv.gz): 11 rows, 4 columns, loaded in 0.193 s, 17102127 bytes allocated, 312544 pool allocs, 2 malloc() calls
[ Info: HistData::CushnyPeeblesN (CushnyPeeblesN.csv.gz): 11 rows, 4 columns, loaded in 0.151 s, 10441805 bytes allocated, 188415 pool allocs, 2 malloc() calls
[ Info: HistData::Dactyl (Dactyl.rda): 60 rows, 3 columns, loaded in 0.033 s, 2879060 bytes allocated, 51806 pool allocs, 2 malloc() calls
[ Info: HistData::DrinksWages (DrinksWages.rda): 70 rows, 6 columns, loaded in 0.002 s, 89904 bytes allocated, 617 pool allocs, 2 malloc() calls
[ Info: HistData::Fingerprints (Fingerprints.rda): 36 rows, 3 columns, loaded in 0.000 s, 87504 bytes allocated, 479 pool allocs, 2 malloc() calls
[ Info: HistData::Galton (Galton.csv.gz): 928 rows, 2 columns, loaded in 0.105 s, 8608359 bytes allocated, 156257 pool allocs, 2 malloc() calls
[ Info: HistData::GaltonFamilies (GaltonFamilies.rda): 934 rows, 8 columns, loaded in 0.001 s, 179536 bytes allocated, 1322 pool allocs, 2 malloc() calls
[ Info: HistData::Guerry (Guerry.rda): 86 rows, 23 columns, loaded in 0.001 s, 190304 bytes allocated, 1050 pool allocs, 2 malloc() calls
[ Info: HistData::Jevons (Jevons.csv.gz): 50 rows, 4 columns, loaded in 0.138 s, 10461469 bytes allocated, 188891 pool allocs, 2 malloc() calls
[ Info: HistData::Langren.all (Langren.all.rda): 61 rows, 4 columns, loaded in 0.000 s, 90464 bytes allocated, 519 pool allocs, 2 malloc() calls
[ Info: HistData::Langren1644 (Langren1644.rda): 12 rows, 9 columns, loaded in 0.000 s, 97088 bytes allocated, 651 pool allocs, 2 malloc() calls
[ Info: HistData::Macdonell (Macdonell.csv.gz): 924 rows, 3 columns, loaded in 0.168 s, 15524433 bytes allocated, 286409 pool allocs, 2 malloc() calls
[ Info: HistData::MacdonellDF (MacdonellDF.csv.gz): 3000 rows, 2 columns, loaded in 0.096 s, 7513073 bytes allocated, 137629 pool allocs, 4 malloc() calls
[ Info: HistData::Michelson (Michelson.csv.gz): 100 rows, 1 columns, loaded in 0.163 s, 15354952 bytes allocated, 281604 pool allocs, 2 malloc() calls
[ Info: HistData::MichelsonSets (MichelsonSets.csv.gz): 20 rows, 5 columns, loaded in 0.227 s, 18056178 bytes allocated, 329733 pool allocs, 2 malloc() calls
[ Info: HistData::Minard.cities (Minard.cities.csv.gz): 20 rows, 3 columns, loaded in 0.183 s, 15980677 bytes allocated, 291101 pool allocs, 2 malloc() calls
[ Info: HistData::Minard.temp (Minard.temp.csv.gz): 9 rows, 4 columns, loaded in 0.195 s, 17170977 bytes allocated, 314394 pool allocs, 2 malloc() calls
[ Info: HistData::Minard.troops (Minard.troops.rda): 51 rows, 5 columns, loaded in 0.000 s, 79504 bytes allocated, 384 pool allocs, 2 malloc() calls
[ Info: HistData::Nightingale (Nightingale.rda): 24 rows, 10 columns, loaded in 0.003 s, 181918 bytes allocated, 1806 pool allocs, 2 malloc() calls
[ Info: HistData::OldMaps (OldMaps.rda): 468 rows, 6 columns, loaded in 0.000 s, 103200 bytes allocated, 442 pool allocs, 2 malloc() calls
[ Info: HistData::PearsonLee (PearsonLee.rda): 746 rows, 6 columns, loaded in 0.000 s, 125696 bytes allocated, 558 pool allocs, 2 malloc() calls
[ Info: HistData::PolioTrials (PolioTrials.rda): 8 rows, 6 columns, loaded in 0.000 s, 101104 bytes allocated, 503 pool allocs, 2 malloc() calls
[ Info: HistData::Prostitutes (Prostitutes.rda): 516 rows, 5 columns, loaded in 0.000 s, 107872 bytes allocated, 458 pool allocs, 2 malloc() calls
[ Info: HistData::Pyx (Pyx.rda): 72 rows, 4 columns, loaded in 0.000 s, 102400 bytes allocated, 594 pool allocs, 2 malloc() calls
[ Info: HistData::Quarrels (Quarrels.rda): 779 rows, 84 columns, loaded in 0.002 s, 1360144 bytes allocated, 6587 pool allocs, 2 malloc() calls
[ Info: HistData::Snow.deaths (Snow.deaths.csv.gz): 578 rows, 3 columns, loaded in 0.200 s, 16187980 bytes allocated, 295136 pool allocs, 2 malloc() calls
[ Info: HistData::Snow.polygons (Snow.polygons.csv.gz): 54 rows, 3 columns, loaded in 0.092 s, 8228109 bytes allocated, 148940 pool allocs, 2 malloc() calls
[ Info: HistData::Snow.pumps (Snow.pumps.csv.gz): 13 rows, 4 columns, loaded in 0.199 s, 17300078 bytes allocated, 315735 pool allocs, 2 malloc() calls
[ Info: HistData::Snow.streets (Snow.streets.csv.gz): 1241 rows, 4 columns, loaded in 0.175 s, 9494531 bytes allocated, 177857 pool allocs, 2 malloc() calls
[ Info: HistData::Wheat.monarchs (Wheat.monarchs.csv.gz): 12 rows, 4 columns, loaded in 0.127 s, 10461517 bytes allocated, 188635 pool allocs, 2 malloc() calls
[ Info: HistData::Yeast (Yeast.rda): 36 rows, 3 columns, loaded in 0.000 s, 77392 bytes allocated, 368 pool allocs, 2 malloc() calls
[ Info: HistData::YeastD.mat (YeastD.mat.csv.gz): 20 rows, 20 columns, loaded in 2.033 s, 175869997 bytes allocated, 2852932 pool allocs, 3 malloc() calls
[ Info: HistData::ZeaMays (ZeaMays.rda): 15 rows, 5 columns, loaded in 0.000 s, 73968 bytes allocated, 387 pool allocs, 2 malloc() calls
[ Info: ISLR::Auto (Auto.rda): 392 rows, 9 columns, loaded in 0.000 s, 143152 bytes allocated, 2682 pool allocs, 2 malloc() calls
[ Info: ISLR::Caravan (Caravan.rda): 5822 rows, 86 columns, loaded in 0.009 s, 4530384 bytes allocated, 18910 pool allocs, 89 malloc() calls
[ Info: ISLR::Carseats (Carseats.rda): 400 rows, 11 columns, loaded in 0.001 s, 162080 bytes allocated, 1811 pool allocs, 2 malloc() calls
[ Info: ISLR::College (College.rda): 777 rows, 19 columns, loaded in 0.001 s, 258176 bytes allocated, 2883 pool allocs, 2 malloc() calls
[ Info: ISLR::Default (Default.rda): 10000 rows, 4 columns, loaded in 0.001 s, 341664 bytes allocated, 447 pool allocs, 6 malloc() calls
[ Info: ISLR::Hitters (Hitters.rda): 322 rows, 20 columns, loaded in 0.001 s, 233952 bytes allocated, 1772 pool allocs, 2 malloc() calls
[ Info: ISLR::OJ (OJ.rda): 1070 rows, 18 columns, loaded in 0.007 s, 317952 bytes allocated, 3833 pool allocs, 2 malloc() calls
[ Info: ISLR::Portfolio (Portfolio.rda): 100 rows, 2 columns, loaded in 0.000 s, 55488 bytes allocated, 252 pool allocs, 2 malloc() calls
[ Info: ISLR::Smarket (Smarket.rda): 1250 rows, 9 columns, loaded in 0.001 s, 158080 bytes allocated, 424 pool allocs, 2 malloc() calls
[ Info: ISLR::Wage (Wage.rda): 3000 rows, 12 columns, loaded in 0.001 s, 382992 bytes allocated, 1096 pool allocs, 4 malloc() calls
[ Info: ISLR::Weekly (Weekly.rda): 1089 rows, 9 columns, loaded in 0.001 s, 147088 bytes allocated, 424 pool allocs, 2 malloc() calls
[ Info: KMsurv::aids (aids.csv.gz): 295 rows, 3 columns, loaded in 0.119 s, 9737155 bytes allocated, 173791 pool allocs, 2 malloc() calls
[ Info: KMsurv::alloauto (alloauto.csv.gz): 101 rows, 3 columns, loaded in 0.187 s, 15989612 bytes allocated, 291607 pool allocs, 2 malloc() calls
[ Info: KMsurv::allograft (allograft.csv.gz): 34 rows, 4 columns, loaded in 0.130 s, 10451325 bytes allocated, 188698 pool allocs, 2 malloc() calls
[ Info: KMsurv::azt (azt.csv.gz): 45 rows, 4 columns, loaded in 0.113 s, 9183047 bytes allocated, 166055 pool allocs, 2 malloc() calls
[ Info: KMsurv::baboon (baboon.csv.gz): 152 rows, 3 columns, loaded in 0.102 s, 8917255 bytes allocated, 161839 pool allocs, 2 malloc() calls
[ Info: KMsurv::bcdeter (bcdeter.csv.gz): 95 rows, 3 columns, loaded in 0.126 s, 9496826 bytes allocated, 171785 pool allocs, 2 malloc() calls
[ Info: KMsurv::bfeed (bfeed.csv.gz): 927 rows, 10 columns, loaded in 0.201 s, 15340921 bytes allocated, 283808 pool allocs, 2 malloc() calls
[ Info: KMsurv::bmt (bmt.csv.gz): 137 rows, 22 columns, loaded in 6.946 s, 571736739 bytes allocated, 9024454 pool allocs, 2 malloc() calls
[ Info: KMsurv::bnct (bnct.csv.gz): 30 rows, 3 columns, loaded in 0.121 s, 8829087 bytes allocated, 159691 pool allocs, 2 malloc() calls
[ Info: KMsurv::btrial (btrial.csv.gz): 45 rows, 3 columns, loaded in 0.119 s, 9487994 bytes allocated, 171215 pool allocs, 2 malloc() calls
[ Info: KMsurv::burn (burn.csv.gz): 154 rows, 18 columns, loaded in 0.340 s, 24282440 bytes allocated, 441440 pool allocs, 2 malloc() calls
[ Info: KMsurv::channing (channing.csv.gz): 462 rows, 6 columns, loaded in 0.213 s, 17275860 bytes allocated, 317980 pool allocs, 2 malloc() calls
[ Info: KMsurv::drug6mp (drug6mp.csv.gz): 21 rows, 5 columns, loaded in 0.139 s, 11420112 bytes allocated, 206085 pool allocs, 2 malloc() calls
[ Info: KMsurv::drughiv (drughiv.csv.gz): 34 rows, 3 columns, loaded in 0.101 s, 8183780 bytes allocated, 148343 pool allocs, 2 malloc() calls
[ Info: KMsurv::hodg (hodg.csv.gz): 43 rows, 6 columns, loaded in 0.155 s, 11749784 bytes allocated, 212629 pool allocs, 2 malloc() calls
[ Info: KMsurv::kidney (kidney.csv.gz): 119 rows, 3 columns, loaded in 0.088 s, 8215469 bytes allocated, 149410 pool allocs, 2 malloc() calls
[ Info: KMsurv::kidrecurr (kidrecurr.csv.gz): 38 rows, 10 columns, loaded in 0.268 s, 22276844 bytes allocated, 404792 pool allocs, 2 malloc() calls
[ Info: KMsurv::kidtran (kidtran.csv.gz): 863 rows, 6 columns, loaded in 0.143 s, 10690114 bytes allocated, 196683 pool allocs, 2 malloc() calls
[ Info: KMsurv::larynx (larynx.csv.gz): 90 rows, 5 columns, loaded in 0.236 s, 17364941 bytes allocated, 315689 pool allocs, 2 malloc() calls
[ Info: KMsurv::lung (lung.csv.gz): 25 rows, 4 columns, loaded in 0.118 s, 9156055 bytes allocated, 165810 pool allocs, 2 malloc() calls
[ Info: KMsurv::pneumon (pneumon.csv.gz): 3470 rows, 15 columns, loaded in 0.373 s, 34830864 bytes allocated, 682407 pool allocs, 17 malloc() calls
[ Info: KMsurv::psych (psych.csv.gz): 26 rows, 4 columns, loaded in 0.112 s, 8509820 bytes allocated, 154432 pool allocs, 2 malloc() calls
[ Info: KMsurv::rats (rats.csv.gz): 150 rows, 4 columns, loaded in 0.117 s, 9847346 bytes allocated, 178703 pool allocs, 2 malloc() calls
[ Info: KMsurv::std (std.rda): 877 rows, 24 columns, loaded in 0.001 s, 280192 bytes allocated, 805 pool allocs, 2 malloc() calls
[ Info: KMsurv::stddiag (stddiag.csv.gz): 25 rows, 2 columns, loaded in 0.104 s, 8519821 bytes allocated, 153693 pool allocs, 2 malloc() calls
[ Info: KMsurv::tongue (tongue.csv.gz): 80 rows, 3 columns, loaded in 0.107 s, 8197988 bytes allocated, 148754 pool allocs, 2 malloc() calls
[ Info: KMsurv::twins (twins.csv.gz): 24 rows, 4 columns, loaded in 0.102 s, 8509468 bytes allocated, 154409 pool allocs, 2 malloc() calls
[ Info: MASS::Aids2 (Aids2.rda): 2843 rows, 7 columns, loaded in 0.001 s, 215472 bytes allocated, 686 pool allocs, 2 malloc() calls
[ Info: MASS::Animals (Animals.csv.gz): 28 rows, 3 columns, loaded in 0.117 s, 9508738 bytes allocated, 171575 pool allocs, 2 malloc() calls
[ Info: MASS::Boston (Boston.csv.gz): 506 rows, 14 columns, loaded in 0.446 s, 38913008 bytes allocated, 709603 pool allocs, 2 malloc() calls
[ Info: MASS::Cars93 (Cars93.rda): 93 rows, 27 columns, loaded in 0.001 s, 251552 bytes allocated, 1925 pool allocs, 2 malloc() calls
[ Info: MASS::Cushings (Cushings.rda): 27 rows, 4 columns, loaded in 0.000 s, 73392 bytes allocated, 536 pool allocs, 2 malloc() calls
[ Info: MASS::DDT (DDT.csv.gz): 15 rows, 1 columns, loaded in 0.091 s, 7542260 bytes allocated, 136034 pool allocs, 2 malloc() calls
[ Info: MASS::GAGurine (GAGurine.csv.gz): 314 rows, 2 columns, loaded in 0.100 s, 8563735 bytes allocated, 155016 pool allocs, 2 malloc() calls
[ Info: MASS::Insurance (Insurance.rda): 64 rows, 5 columns, loaded in 0.000 s, 106656 bytes allocated, 576 pool allocs, 2 malloc() calls
[ Info: MASS::Melanoma (Melanoma.csv.gz): 205 rows, 7 columns, loaded in 0.228 s, 18092707 bytes allocated, 331464 pool allocs, 2 malloc() calls
[ Info: MASS::OME (OME.rda): 1097 rows, 7 columns, loaded in 0.001 s, 139040 bytes allocated, 510 pool allocs, 2 malloc() calls
[ Info: MASS::Pima.te (Pima.te.rda): 332 rows, 8 columns, loaded in 0.000 s, 106720 bytes allocated, 433 pool allocs, 2 malloc() calls
[ Info: MASS::Pima.tr (Pima.tr.rda): 200 rows, 8 columns, loaded in 0.000 s, 101040 bytes allocated, 435 pool allocs, 2 malloc() calls
[ Info: MASS::Pima.tr2 (Pima.tr2.rda): 300 rows, 8 columns, loaded in 0.000 s, 112016 bytes allocated, 447 pool allocs, 2 malloc() calls
[ Info: MASS::Rabbit (Rabbit.rda): 60 rows, 5 columns, loaded in 0.001 s, 98352 bytes allocated, 580 pool allocs, 2 malloc() calls
[ Info: MASS::Rubber (Rubber.csv.gz): 30 rows, 3 columns, loaded in 0.138 s, 9479322 bytes allocated, 171084 pool allocs, 2 malloc() calls
[ Info: MASS::SP500 (SP500.csv.gz): 2780 rows, 1 columns, loaded in 0.001 s, 294240 bytes allocated, 3485 pool allocs, 3 malloc() calls
[ Info: MASS::Sitka (Sitka.rda): 395 rows, 4 columns, loaded in 0.000 s, 86032 bytes allocated, 371 pool allocs, 2 malloc() calls
[ Info: MASS::Sitka89 (Sitka89.rda): 632 rows, 4 columns, loaded in 0.000 s, 90816 bytes allocated, 374 pool allocs, 2 malloc() calls
[ Info: MASS::Skye (Skye.csv.gz): 23 rows, 3 columns, loaded in 0.124 s, 9478106 bytes allocated, 171021 pool allocs, 2 malloc() calls
[ Info: MASS::Traffic (Traffic.rda): 184 rows, 4 columns, loaded in 0.000 s, 85088 bytes allocated, 377 pool allocs, 2 malloc() calls
[ Info: MASS::UScereal (UScereal.rda): 65 rows, 12 columns, loaded in 0.000 s, 108640 bytes allocated, 964 pool allocs, 2 malloc() calls
[ Info: MASS::UScrime (UScrime.csv.gz): 47 rows, 16 columns, loaded in 0.440 s, 38348602 bytes allocated, 673521 pool allocs, 2 malloc() calls
[ Info: MASS::VA (VA.rda): 137 rows, 8 columns, loaded in 0.000 s, 124240 bytes allocated, 605 pool allocs, 2 malloc() calls
[ Info: MASS::abbey (abbey.csv.gz): 31 rows, 1 columns, loaded in 0.088 s, 7543028 bytes allocated, 136078 pool allocs, 2 malloc() calls
[ Info: MASS::anorexia (anorexia.rda): 72 rows, 3 columns, loaded in 0.000 s, 69472 bytes allocated, 354 pool allocs, 2 malloc() calls
[ Info: MASS::bacteria (bacteria.rda): 220 rows, 6 columns, loaded in 0.000 s, 143120 bytes allocated, 916 pool allocs, 2 malloc() calls
[ Info: MASS::beav1 (beav1.csv.gz): 114 rows, 4 columns, loaded in 0.189 s, 15683484 bytes allocated, 286888 pool allocs, 2 malloc() calls
[ Info: MASS::beav2 (beav2.csv.gz): 100 rows, 4 columns, loaded in 0.001 s, 111952 bytes allocated, 1916 pool allocs, 2 malloc() calls
[ Info: MASS::biopsy (biopsy.rda): 699 rows, 11 columns, loaded in 0.001 s, 198032 bytes allocated, 2602 pool allocs, 2 malloc() calls
[ Info: MASS::birthwt (birthwt.csv.gz): 189 rows, 11 columns, loaded in 0.205 s, 15484552 bytes allocated, 282692 pool allocs, 2 malloc() calls
[ Info: MASS::cabbages (cabbages.rda): 60 rows, 4 columns, loaded in 0.000 s, 87360 bytes allocated, 452 pool allocs, 2 malloc() calls
[ Info: MASS::caith (caith.csv.gz): 4 rows, 6 columns, loaded in 0.249 s, 18880230 bytes allocated, 343170 pool allocs, 2 malloc() calls
[ Info: MASS::cats (cats.rda): 144 rows, 3 columns, loaded in 0.000 s, 70800 bytes allocated, 350 pool allocs, 2 malloc() calls
[ Info: MASS::cement (cement.csv.gz): 13 rows, 5 columns, loaded in 0.209 s, 17917752 bytes allocated, 325816 pool allocs, 2 malloc() calls
[ Info: MASS::chem (chem.csv.gz): 24 rows, 1 columns, loaded in 0.001 s, 56800 bytes allocated, 354 pool allocs, 2 malloc() calls
[ Info: MASS::coop (coop.rda): 252 rows, 4 columns, loaded in 0.000 s, 101760 bytes allocated, 563 pool allocs, 2 malloc() calls
[ Info: MASS::cpus (cpus.csv.gz): 209 rows, 9 columns, loaded in 0.278 s, 21437660 bytes allocated, 391624 pool allocs, 2 malloc() calls
[ Info: MASS::crabs (crabs.rda): 200 rows, 8 columns, loaded in 0.000 s, 99312 bytes allocated, 496 pool allocs, 2 malloc() calls
[ Info: MASS::eagles (eagles.rda): 8 rows, 5 columns, loaded in 0.000 s, 104144 bytes allocated, 546 pool allocs, 2 malloc() calls
[ Info: MASS::epil (epil.rda): 236 rows, 9 columns, loaded in 0.000 s, 108672 bytes allocated, 452 pool allocs, 2 malloc() calls
[ Info: MASS::farms (farms.rda): 20 rows, 4 columns, loaded in 0.000 s, 109072 bytes allocated, 637 pool allocs, 2 malloc() calls
[ Info: MASS::fgl (fgl.rda): 214 rows, 10 columns, loaded in 0.000 s, 88240 bytes allocated, 450 pool allocs, 2 malloc() calls
[ Info: MASS::forbes (forbes.csv.gz): 17 rows, 2 columns, loaded in 0.100 s, 8537319 bytes allocated, 153686 pool allocs, 2 malloc() calls
[ Info: MASS::galaxies (galaxies.csv.gz): 82 rows, 1 columns, loaded in 0.092 s, 7555524 bytes allocated, 136322 pool allocs, 2 malloc() calls
[ Info: MASS::gehan (gehan.rda): 42 rows, 4 columns, loaded in 0.000 s, 82352 bytes allocated, 377 pool allocs, 2 malloc() calls
[ Info: MASS::genotype (genotype.rda): 61 rows, 3 columns, loaded in 0.000 s, 82624 bytes allocated, 447 pool allocs, 2 malloc() calls
[ Info: MASS::geyser (geyser.csv.gz): 299 rows, 2 columns, loaded in 0.095 s, 8647037 bytes allocated, 155219 pool allocs, 2 malloc() calls
[ Info: MASS::gilgais (gilgais.csv.gz): 365 rows, 9 columns, loaded in 0.265 s, 22172432 bytes allocated, 405239 pool allocs, 2 malloc() calls
[ Info: MASS::hills (hills.csv.gz): 35 rows, 4 columns, loaded in 0.198 s, 17307502 bytes allocated, 315595 pool allocs, 2 malloc() calls
[ Info: MASS::housing (housing.rda): 72 rows, 5 columns, loaded in 0.000 s, 115168 bytes allocated, 641 pool allocs, 2 malloc() calls
[ Info: MASS::immer (immer.rda): 30 rows, 4 columns, loaded in 0.000 s, 82960 bytes allocated, 471 pool allocs, 2 malloc() calls
[ Info: MASS::leuk (leuk.rda): 33 rows, 3 columns, loaded in 0.000 s, 77136 bytes allocated, 360 pool allocs, 2 malloc() calls
[ Info: MASS::mammals (mammals.csv.gz): 62 rows, 3 columns, loaded in 0.095 s, 8247116 bytes allocated, 149297 pool allocs, 2 malloc() calls
[ Info: MASS::mcycle (mcycle.csv.gz): 133 rows, 2 columns, loaded in 0.111 s, 8546951 bytes allocated, 154420 pool allocs, 2 malloc() calls
[ Info: MASS::menarche (menarche.csv.gz): 25 rows, 3 columns, loaded in 0.098 s, 8848024 bytes allocated, 159862 pool allocs, 2 malloc() calls
[ Info: MASS::michelson (michelson.rda): 100 rows, 3 columns, loaded in 0.000 s, 90448 bytes allocated, 523 pool allocs, 2 malloc() calls
[ Info: MASS::minn38 (minn38.rda): 168 rows, 5 columns, loaded in 0.000 s, 117968 bytes allocated, 654 pool allocs, 2 malloc() calls
[ Info: MASS::motors (motors.csv.gz): 40 rows, 3 columns, loaded in 0.110 s, 8833167 bytes allocated, 159845 pool allocs, 2 malloc() calls
[ Info: MASS::muscle (muscle.rda): 60 rows, 4 columns, loaded in 0.000 s, 81616 bytes allocated, 804 pool allocs, 2 malloc() calls
[ Info: MASS::newcomb (newcomb.csv.gz): 66 rows, 1 columns, loaded in 0.001 s, 58288 bytes allocated, 461 pool allocs, 2 malloc() calls
[ Info: MASS::nlschools (nlschools.rda): 2287 rows, 6 columns, loaded in 0.001 s, 184672 bytes allocated, 1012 pool allocs, 3 malloc() calls
[ Info: MASS::npk (npk.rda): 24 rows, 5 columns, loaded in 0.000 s, 109440 bytes allocated, 633 pool allocs, 2 malloc() calls
[ Info: MASS::npr1 (npr1.csv.gz): 104 rows, 5 columns, loaded in 0.182 s, 17015450 bytes allocated, 311912 pool allocs, 2 malloc() calls
[ Info: MASS::oats (oats.rda): 72 rows, 4 columns, loaded in 0.000 s, 101280 bytes allocated, 557 pool allocs, 2 malloc() calls
[ Info: MASS::painters (painters.rda): 54 rows, 6 columns, loaded in 0.000 s, 97216 bytes allocated, 758 pool allocs, 2 malloc() calls
[ Info: MASS::petrol (petrol.rda): 32 rows, 6 columns, loaded in 0.000 s, 80368 bytes allocated, 428 pool allocs, 2 malloc() calls
[ Info: MASS::quine (quine.rda): 146 rows, 5 columns, loaded in 0.000 s, 116480 bytes allocated, 630 pool allocs, 2 malloc() calls
[ Info: MASS::road (road.csv.gz): 26 rows, 7 columns, loaded in 0.249 s, 20766187 bytes allocated, 379052 pool allocs, 2 malloc() calls
[ Info: MASS::rotifer (rotifer.csv.gz): 20 rows, 5 columns, loaded in 0.208 s, 17285565 bytes allocated, 314554 pool allocs, 2 malloc() calls
[ Info: MASS::ships (ships.rda): 40 rows, 5 columns, loaded in 0.000 s, 87824 bytes allocated, 406 pool allocs, 2 malloc() calls
[ Info: MASS::shoes (shoes.csv.gz): 10 rows, 2 columns, loaded in 0.132 s, 8511943 bytes allocated, 153638 pool allocs, 2 malloc() calls
[ Info: MASS::shrimp (shrimp.csv.gz): 18 rows, 1 columns, loaded in 0.001 s, 56448 bytes allocated, 336 pool allocs, 2 malloc() calls
[ Info: MASS::shuttle (shuttle.rda): 256 rows, 7 columns, loaded in 0.000 s, 159040 bytes allocated, 900 pool allocs, 2 malloc() calls
[ Info: MASS::snails (snails.rda): 96 rows, 6 columns, loaded in 0.000 s, 90304 bytes allocated, 406 pool allocs, 2 malloc() calls
[ Info: MASS::steam (steam.csv.gz): 14 rows, 2 columns, loaded in 0.100 s, 7872706 bytes allocated, 142252 pool allocs, 2 malloc() calls
[ Info: MASS::stormer (stormer.csv.gz): 23 rows, 3 columns, loaded in 0.185 s, 16002972 bytes allocated, 290968 pool allocs, 3 malloc() calls
[ Info: MASS::survey (survey.rda): 237 rows, 12 columns, loaded in 0.001 s, 185856 bytes allocated, 1012 pool allocs, 2 malloc() calls
[ Info: MASS::synth.te (synth.te.csv.gz): 1000 rows, 3 columns, loaded in 0.113 s, 9786035 bytes allocated, 178805 pool allocs, 2 malloc() calls
[ Info: MASS::synth.tr (synth.tr.csv.gz): 250 rows, 3 columns, loaded in 0.002 s, 194320 bytes allocated, 2927 pool allocs, 2 malloc() calls
[ Info: MASS::topo (topo.csv.gz): 52 rows, 3 columns, loaded in 0.098 s, 8860696 bytes allocated, 160351 pool allocs, 2 malloc() calls
[ Info: MASS::waders (waders.csv.gz): 15 rows, 20 columns, loaded in 1.963 s, 168896258 bytes allocated, 2723659 pool allocs, 2 malloc() calls
[ Info: MASS::whiteside (whiteside.rda): 56 rows, 3 columns, loaded in 0.000 s, 68992 bytes allocated, 350 pool allocs, 2 malloc() calls
[ Info: MASS::wtloss (wtloss.csv.gz): 52 rows, 2 columns, loaded in 0.146 s, 8536861 bytes allocated, 153883 pool allocs, 2 malloc() calls
[ Info: SASmixed::AvgDailyGain (AvgDailyGain.rda): 32 rows, 6 columns, loaded in 0.000 s, 100224 bytes allocated, 612 pool allocs, 2 malloc() calls
[ Info: SASmixed::Multilocation (Multilocation.rda): 108 rows, 7 columns, loaded in 0.001 s, 131472 bytes allocated, 1115 pool allocs, 2 malloc() calls
[ Info: SASmixed::SIMS (SIMS.rda): 3691 rows, 3 columns, loaded in 0.001 s, 428912 bytes allocated, 12195 pool allocs, 3 malloc() calls
[ Info: Zelig::PErisk (PErisk.csv.gz): 62 rows, 6 columns, loaded in 0.271 s, 18620167 bytes allocated, 340154 pool allocs, 2 malloc() calls
[ Info: Zelig::SupremeCourt (SupremeCourt.csv.gz): 43 rows, 9 columns, loaded in 0.349 s, 22009918 bytes allocated, 401449 pool allocs, 2 malloc() calls
[ Info: Zelig::Weimar (Weimar.csv.gz): 10 rows, 11 columns, loaded in 0.335 s, 24037337 bytes allocated, 435650 pool allocs, 2 malloc() calls
[ Info: Zelig::approval (approval.csv.gz): 65 rows, 8 columns, loaded in 0.288 s, 20292175 bytes allocated, 369384 pool allocs, 2 malloc() calls
[ Info: Zelig::bivariate (bivariate.csv.gz): 78 rows, 6 columns, loaded in 0.210 s, 11757304 bytes allocated, 213245 pool allocs, 2 malloc() calls
[ Info: Zelig::coalition (coalition.csv.gz): 314 rows, 7 columns, loaded in 0.236 s, 20021940 bytes allocated, 366391 pool allocs, 2 malloc() calls
[ Info: Zelig::coalition2 (coalition2.rda): 314 rows, 8 columns, loaded in 0.001 s, 134320 bytes allocated, 1437 pool allocs, 2 malloc() calls
[ Info: Zelig::eidat (eidat.csv.gz): 10 rows, 4 columns, loaded in 0.143 s, 10441261 bytes allocated, 188397 pool allocs, 2 malloc() calls
[ Info: Zelig::free1 (free1.rda): 450 rows, 12 columns, loaded in 0.001 s, 214096 bytes allocated, 3232 pool allocs, 2 malloc() calls
[ Info: Zelig::free2 (free2.rda): 450 rows, 12 columns, loaded in 0.001 s, 214096 bytes allocated, 3232 pool allocs, 2 malloc() calls
[ Info: Zelig::grunfeld (grunfeld.csv.gz): 20 rows, 7 columns, loaded in 0.256 s, 19742233 bytes allocated, 350562 pool allocs, 2 malloc() calls
[ Info: Zelig::hoff (hoff.csv.gz): 36 rows, 5 columns, loaded in 0.239 s, 17284749 bytes allocated, 315154 pool allocs, 2 malloc() calls
[ Info: Zelig::homerun (homerun.rda): 314 rows, 5 columns, loaded in 0.000 s, 125936 bytes allocated, 1433 pool allocs, 2 malloc() calls
[ Info: Zelig::immi1 (immi1.csv.gz): 2485 rows, 5 columns, loaded in 0.171 s, 12343871 bytes allocated, 237993 pool allocs, 7 malloc() calls
[ Info: Zelig::immi2 (immi2.csv.gz): 2485 rows, 5 columns, loaded in 0.003 s, 971440 bytes allocated, 32425 pool allocs, 7 malloc() calls
[ Info: Zelig::immi3 (immi3.csv.gz): 2485 rows, 5 columns, loaded in 0.004 s, 971440 bytes allocated, 32425 pool allocs, 7 malloc() calls
[ Info: Zelig::immi4 (immi4.csv.gz): 2485 rows, 5 columns, loaded in 0.003 s, 971440 bytes allocated, 32425 pool allocs, 7 malloc() calls
[ Info: Zelig::immi5 (immi5.csv.gz): 2485 rows, 5 columns, loaded in 0.003 s, 971440 bytes allocated, 32425 pool allocs, 7 malloc() calls
[ Info: Zelig::immigration (immigration.csv.gz): 2485 rows, 5 columns, loaded in 0.138 s, 10173950 bytes allocated, 198578 pool allocs, 7 malloc() calls
[ Info: Zelig::klein (klein.csv.gz): 21 rows, 14 columns, loaded in 0.368 s, 29800096 bytes allocated, 528691 pool allocs, 13 malloc() calls
andreasnoack commented 6 years ago

@JeffBezanson I think this might hang in subtyping. When interrupting the process I got

ds = convert(String, r[:Dataset]) = "mexico"
^C^C^C^C^C^CWARNING: Force throwing a SIGINT
Internal error: encountered unexpected error in runtime:
InterruptException()
has_free_typevars at /Users/osx/buildbot/slave/package_osx64/build/src/jltypes.c:150
subtype_tuple at /Users/osx/buildbot/slave/package_osx64/build/src/subtype.c:851 [inlined]
subtype at /Users/osx/buildbot/slave/package_osx64/build/src/subtype.c:997
exists_subtype at /Users/osx/buildbot/slave/package_osx64/build/src/subtype.c:1078 [inlined]
forall_exists_subtype at /Users/osx/buildbot/slave/package_osx64/build/src/subtype.c:1106
subtype_ccheck at /Users/osx/buildbot/slave/package_osx64/build/src/subtype.c:461
var_gt at /Users/osx/buildbot/slave/package_osx64/build/src/subtype.c:545
exists_subtype at /Users/osx/buildbot/slave/package_osx64/build/src/subtype.c:1078 [inlined]
forall_exists_subtype at /Users/osx/buildbot/slave/package_osx64/build/src/subtype.c:1106
forall_exists_equal at /Users/osx/buildbot/slave/package_osx64/build/src/subtype.c:1045
subtype at /Users/osx/buildbot/slave/package_osx64/build/src/subtype.c:1017
subtype_unionall at /Users/osx/buildbot/slave/package_osx64/build/src/subtype.c:638
subtype_unionall at /Users/osx/buildbot/slave/package_osx64/build/src/subtype.c:638
exists_subtype at /Users/osx/buildbot/slave/package_osx64/build/src/subtype.c:1078 [inlined]
...

and I get something similar when taking a sample during execution. It seems to be consistently happening in the mexico dataset from the Zelig group but it doesn't happen if I load the datasets one-by-one outside of a loop. However, it seems to be sufficient to just run this loop https://github.com/johnmyleswhite/RDatasets.jl/blob/482317f99aa76b4cbc1f2df4ee0068d25c032ae1/test/dataset.jl#L8-L20

JeffBezanson commented 6 years ago
T = Tuple{NTuple{33,Union{Missing, Int64}}}

T <: Tuple{T} where T<:(Tuple{Vararg{E, N} where N} where E)
# fine

T <: (Tuple{T} where T<:Tuple{Vararg{E, N} where N}) where E
# takes "a while" :)

I'll work on it. I think the problem is that we have to wait until we leave the UnionAll for E to see that its constraints can't be met, when we could potentially see that earlier.

andreasnoack commented 6 years ago

I suspect that this might be the same as reported in https://github.com/JuliaData/CSV.jl/issues/236

randyzwitch commented 6 years ago

This memory leak aside, is there any reason else not to merge this PR? It doesn't appear the memory leak is related to the PR itself, and the memory leak can be explored elsewhere.

alyst commented 6 years ago

@randyzwitch Thanks for the reminder! Now it passes on Julia nightly, so I think it's safe to merge (cc @andreasnoack)

randyzwitch commented 6 years ago

@alyst I'm not sure passing only on nightly is a reasonable jump for this package, since 0.7 and 1.0 is the original goal

alyst commented 6 years ago

@randyzwitch CI doesn't pass on 0.7 and 1.0 because of (at least) JuliaLang/julia#28677, RDatasets (and CSV) itself is fine. I think it's fine to release it, because IRL you would not try to load all CSV datasets at once, loading individual datasets should be ok. Also, I would have postpone it, had there been already a version of RDatasets that doesn't trigger Julia bug, but ATM there are no stable versions that run on v0.7/v1.0. Alternatively, we can wait until 1.0.1.

randyzwitch commented 6 years ago

Thanks for the explanation @alyst. Can you add this note to the README, so that users coming in can understand that the package is expected to be working, just that it depends on a fix in master? Then I'll merge it in.

alyst commented 6 years ago

@randyzwitch Good idea, I've added a note to README.

randyzwitch commented 6 years ago

Merged

alyst commented 6 years ago

@randyzwitch Could you please also tag a new minor release?

andreasnoack commented 6 years ago

It looks like Attobot isn't enabled so I've opened https://github.com/JuliaLang/METADATA.jl/pull/17366