Open kalebm1 opened 3 years ago
I'm having exactly the same problem, and I'm also sending the SessionID in cookies if anyone say it might be the problem... Still trying to understand what could be causing this issue
I have the same issue when I search using proxy and sessionid. I think the problem is defining the sessionid, that's why missing data is coming. And the library gives error but I couldn't find how to solve it.
Describe the bug I am trying to scrape posts from a hashtag. I am have used the both the Selenium driver and headers with a sessionid way of getting around the Instagram redirect to login page error. Before Instagram was redirecting to the login page, I was able to successfully scrape the hashtag with no problem. Once the redirection occurred, I inputted my sessionid into the headers field and got the following error:
post_arr = self.json_dict["entry_data"]["TagPage"][0]["graphql"]["hashtag"]["edge_hashtag_to_media"]["edges"] KeyError: 'graphql'
. I am fairly new to the library, so I decided to poke around in the code a bit and read through similar issues. After poking around, I think this error is similar to #124 in the sense that thejson_dict
s are not structured the same. I printed thejson_dict
out to a file and found that there is no graphql available nor are there many of the other things that theget_recent_posts
looks for. I hope the fix for this error is as simple as the other issue.To Reproduce Steps to reproduce the behavior:
Expected behavior The expected outcome is a List[Posts] as what should typically be returned when calling the
hashtag.get_recent_posts()
method.Screenshots
Desktop (please complete the following information):