Closed danzhechen closed 7 months ago
In load_tweets_batch.py, you have not replaced all instances of id_urls with url. To do that I would suggest pressing ESC and then typing /id_urls.
In this case I think you have not replaced id_urls in users table in load_tweets_batch.py with url
Hi Abizer @abizermamnoon ,
I tried your method, and I think I did make all the changes for id_urls. I tried to bring the docker up and down, cleaned the volume, but it still had the same issue.
Hi @danzhechen, please let me know if you find a solution to this error, as I am encountering the same thing :(
@danzhechen @mmendiratta27 I was getting this same error from a missed change in the load_tweets_batch.py file. I would definitely recommend going over that file as well as the schema.sql file as a small missed edit or typo could be causing that issue.
One possible (and common) cause of errors like this that hasn't been mentioned yet is that you've modified your schema.sql
file, but you haven't deleted the volume and rebuilt the image properly. I would also double check those two steps.
Thanks for everyone's help. I think I still have the same issue on lambda server but I somehow passed on the GitHub. I checked everything, I stopped all the dockers, I deleted the volume and rebuilt the image. I do not why, I think I will come to office hour for that issue.
I have the same experience as @danzhechen. How should we fill out the time section @mikeizbicki? I have run times for everything except pg_normalized_batch
for parallel running.
@mmendiratta27 I'm fairly confident this error is due to not correctly rebuilding your image. (It works on github because there is nothing to rebuild.) In order to record the timings, you will have to get it working on the lambda server. And if it's not working, you won't get credit for that part. I would be happy to help you figure out the problem after class/in office hours tomorrow.
Thanks to the help of @abizermamnoon. And Mike is right. I think my problem should be not delete my images. Be sure to delete your images, delete the volume and build the whole things up again. I have a strange issue saying conflict: unable to delete. I forced the image to be deletes and it works now.
i ran docker volume ls
, then got all volume names and ran docker volume rm volume_name1 volume_name2 volume_name3....
for all volumes, then randocker-compose down
, docker-compose up -d
, and ./load_tweets_parallel.sh
. that cleaned everything for me.
Hi there,
I am working on the part of Normalized Data (batched). I did finish editing all the requirements and I even deleted all the FOREIGN KEYS. But when I run this command to test:
I already assigned the value to $files in my environment. I still keep getting two error messages.
I think the bigger problem is this one:
Anyone has an idea of what might cause the problem?