ldbc-dev / ldbc_snb_datagen_deprecated2015

LDBC-SNB Data Generator
GNU General Public License v3.0
12 stars 5 forks source link

duplicate universities #12

Open alexaverbuch opened 10 years ago

alexaverbuch commented 10 years ago

params.ini:

numPersons:1000
numYears:1
startYear:2010
compressed:false
serializer:csv
numThreads:1
updateStreams:false
outputDir:/Users/alexaverbuch/hadoopTempDir/output/

grep output:

alexaverbuch$ grep -in 'Aga_Khan_University' organisation_0.csv 
1581:1579|university|Aga_Khan_University|http://dbpedia.org/resource/Aga_Khan_University|
2546:2544|university|Aga_Khan_University|http://dbpedia.org/resource/Aga_Khan_University|
4079:4077|university|Aga_Khan_University|http://dbpedia.org/resource/Aga_Khan_University|
4970:4968|university|Aga_Khan_University|http://dbpedia.org/resource/Aga_Khan_University|
5441:5439|university|Aga_Khan_University|http://dbpedia.org/resource/Aga_Khan_University|
6517:6515|university|Aga_Khan_University|http://dbpedia.org/resource/Aga_Khan_University|
6796:6794|university|Aga_Khan_University|http://dbpedia.org/resource/Aga_Khan_University|