Coleridge-Initiative / RCDatasets

Creative Commons Zero v1.0 Universal
3 stars 2 forks source link

Are these datasets duplicates? #116

Closed ceteri closed 4 years ago

ceteri commented 4 years ago

dataset-584 and dataset-068 from National Student Clearinghouse Research Center appear to be duplicates?

  {'description': 'Thousands of high schools currently use StudentTracker® reports from the Research Center to measure how many of their graduates go on to college, where they attend, and how many persist to graduation. The reports were designed to help schools to measure their success in preparing students for college, and to evaluate the effectiveness of college access programs and curricula.', 'id': 'dataset-068', 'provider': 'provider-163', 'title': 'College Enrollment', 'url': 'https://nscresearchcenter.org/workingwithourdata/'}

  {'description': 'The National Student Clearinghouse regularly publishes research on student enrollment, movement, and other important student outcomes using student-level data provided exclusively to the National Student Clearinghouse by our nationwide network of postsecondary institutions. These reports are to benefit and better inform the educational community, policymakers, community leaders, and others.', 'id': 'dataset-584', 'provider': 'provider-163', 'title': 'National Student Clearinghouse data', 'url': 'https://nscresearchcenter.org/workingwithourdata/'}

If they are duplicates, please go back through the partitions and substitute instances of dataset-584 with dataset-068, then delete dataset-584.

ceteri commented 4 years ago

Also, are these duplicates?