Closed greut closed 12 years ago
I think what's best is to go with bulk loader (LOAD DATA …) because it's blazzing fast and easy. The first step is to create clean CSV files.
LOAD DATA …
What I don't know yet is how much the data must be sanitized, like DePaul and DePaul University in drafts.csv.
DePaul
DePaul University
drafts.csv
(Note: the dataset is pretty small anyway so speed is not really an issue here)
I wouldn't even bother normalizing universities, it's just a text column , not a unique id = no biggie.
I'm pretty sure they will be pita's about NF's.
We're done for now ;-)
I think what's best is to go with bulk loader (
LOAD DATA …
) because it's blazzing fast and easy. The first step is to create clean CSV files.What I don't know yet is how much the data must be sanitized, like
DePaul
andDePaul University
indrafts.csv
.