Btibert3 / pypeds

Python package to facilitate and standardize enrollment science within higher education and Enrollment Management
https://www.linkedin.com/pulse/introducing-pypeds-brock-tibert/
GNU General Public License v3.0
9 stars 6 forks source link

One time extract error? #6

Open Btibert3 opened 4 years ago

Btibert3 commented 4 years ago

I am getting the following error from time to time

>>> years = list(range(2002, 2019))
>>> 
>>> # instantiate and go
>>> tmp = ipeds.ICAY(years=years)
>>> tmp.extract()
sys:1: DtypeWarning: Columns (177,186,195) have mixed types. Specify dtype option on import or set low_memory=False.
>>> tmp.extract()
>>> x = tmp.load()
>>> x.shape
(156330, 297)
>>> x.fall_year.value_counts(dropna=False, sort=False)
2002.0    9412
2003.0    9460
2004.0    9262
2005.0    9136
2006.0    9146
2007.0    9126
2008.0    9206
2009.0    9322
2010.0    9356
2011.0    9586
2012.0    9612
2013.0    9566
2014.0    9320
2015.0    9214
2016.0    8794
2017.0    8562
2018.0    8250
Name: fall_year, dtype: int64

but this feels random, such that I can re-run extract and it works. Perhaps its my coding environment?

Consider low memory option as a param and/or emphasize tools like Google Collab or Kaggle.