Closed smondet closed 6 years ago
@smondet thanks for reporting those. But remember, you should not add the indexes before loading the tables. The ETL might be far longer in this way. I suggest you to remove them. (updating indexes way more complex than building them from scratch)
Do you mean that we should not build the indexes with "psql "$OMOP" -f "omop/build-omop/postgresql/OMOP CDM postgresql indexes.txt"" until after running the ETL with "psql "$MIMIC" -f "etl/etl.sql""? I'm just trying to clarify because the order implied by the README suggests that we should build the indexes first.
That was probably written by me - and probably it is a lot slower than it could be by doing it in my suggested order!
Yes exactly: indexes, integrity constraints shall be added after the ETL step.
Let me modify the REAME accordingly.
well @alistairewj looks ready to update his great documentation
Awesome, that's good to know. I have also been having similar issues with duplicate keys, but I'll try re-running with the indexes at the end.
Yeah no problem - I think I do mention it about the constraints but I added it about the indexes. The constraints are clearly useful though - wouldn't have found these bugs without them :)
After trying the fix for #48, I got: