behavioral-ds / evently

evently: simulation, fitting of Hawkes processes
https://www.behavioral-ds.science/evently
MIT License
16 stars 2 forks source link

Error in .checkTypos(e, names_x) : Object 'retweet_id' not found amongst #17

Open monanasery opened 1 year ago

monanasery commented 1 year ago

Hi, Sorry to open a new issue.

Your code works fine with your sample data "tweets_anonymized.jsonl", but when I run your code (the following line): cascades <- parse_raw_tweets_to_cascades(filepath, batch = 10000, output_path = "data", progress = T, api_version=1) , I got this error: Error in .checkTypos(e, names_x) : Object 'retweet_id' not found amongst

My data is also jsonl tweet object (v1 api), and the structure looks similar to your data. Here is the first line of my data (the remaining lines have the same structure):

{"created_at": "Fri Mar 13 02:23:25 +0000 2020", "id": 1238289533789261828, "id_str": "1238289533789261828", "full_text": "RT @warriors_mom: Remember the Obama H1N1 NCAA shutdown? Me neither! \ud83e\udd26\ud83c\udffb\u200d\u2640\ufe0f\n\n\u201cMarch Madness is CANCELLED: NCAA calls off all major spring co\u2026", "truncated": false, "display_text_range": [0, 140], "entities": {"hashtags": [], "symbols": [], "user_mentions": [{"screen_name": "warriors_mom", "name": "CyberChick", "id": 75184478, "id_str": "75184478", "indices": [3, 16]}], "urls": []}, "source": "<a href=\"https://mobile.twitter.com\" rel=\"nofollow\">Twitter Web App</a>", "in_reply_to_status_id": null, "in_reply_to_status_id_str": null, "in_reply_to_user_id": null, "in_reply_to_user_id_str": null, "in_reply_to_screen_name": null, "user": {"id": 2596787408, "id_str": "2596787408", "name": "Mike Packer", "screen_name": "realmikepacker", "location": "South Carolina, USA", "description": "Grandpa, dog lover, audiophile, golfer, Happy resident of the great state of South Carolina! Clemson #ALLIN #2A Conservative, GSD's, \nNO CRYPTO", "url": null, "entities": {"description": {"urls": []}}, "protected": false, "followers_count": 13517, "friends_count": 14046, "listed_count": 5, "created_at": "Mon Jun 30 19:11:42 +0000 2014", "favourites_count": 388121, "utc_offset": null, "time_zone": null, "geo_enabled": true, "verified": false, "statuses_count": 116999, "lang": null, "contributors_enabled": false, "is_translator": false, "is_translation_enabled": false, "profile_background_color": "000000", "profile_background_image_url": "http://abs.twimg.com/images/themes/theme1/bg.png", "profile_background_image_url_https": "https://abs.twimg.com/images/themes/theme1/bg.png", "profile_background_tile": false, "profile_image_url": "http://pbs.twimg.com/profile_images/846018120535830528/BeBLT4C2_normal.jpg", "profile_image_url_https": "https://pbs.twimg.com/profile_images/846018120535830528/BeBLT4C2_normal.jpg", "profile_banner_url": "https://pbs.twimg.com/profile_banners/2596787408/1471009347", "profile_image_extensions_alt_text": null, "profile_banner_extensions_alt_text": null, "profile_link_color": "0084B4", "profile_sidebar_border_color": "000000", "profile_sidebar_fill_color": "000000", "profile_text_color": "000000", "profile_use_background_image": false, "has_extended_profile": true, "default_profile": false, "default_profile_image": false, "following": false, "follow_request_sent": false, "notifications": false, "translator_type": "none", "withheld_in_countries": []}, "geo": null, "coordinates": null, "place": null, "contributors": null, "retweeted_status": {"created_at": "Fri Mar 13 02:17:22 +0000 2020", "id": 1238288010380271616, "id_str": "1238288010380271616", "full_text": "Remember the Obama H1N1 NCAA shutdown? Me neither! \ud83e\udd26\ud83c\udffb\u200d\u2640\ufe0f\n\n\u201cMarch Madness is CANCELLED: NCAA calls off all major spring competitions - including the $1B men's &amp; women's basketball tournaments - over coronavirus fears.\u201d #MarchSadness #Disinformation https://t.co/8n77sD1vYW", "truncated": false, "display_text_range": [0, 275], "entities": {"hashtags": [{"text": "MarchSadness", "indices": [222, 235]}, {"text": "Disinformation", "indices": [236, 251]}], "symbols": [], "user_mentions": [], "urls": [{"url": "https://t.co/8n77sD1vYW", "expanded_url": "https://www.dailymail.co.uk/news/article-8105591/Big-Ten-ACC-SEC-Big-12-CANCEL-conference-basketball-tournaments.html", "display_url": "dailymail.co.uk/news/article-8\u2026", "indices": [252, 275]}]}, "source": "<a href=\"http://twitter.com/download/iphone\" rel=\"nofollow\">Twitter for iPhone</a>", "in_reply_to_status_id": null, "in_reply_to_status_id_str": null, "in_reply_to_user_id": null, "in_reply_to_user_id_str": null, "in_reply_to_screen_name": null, "user": {"id": 75184478, "id_str": "75184478", "name": "CyberChick", "screen_name": "warriors_mom", "location": "MN lakes to TN mountains", "description": "#MilitaryFamily #CyberSecurity Personal IMHO acct here. I ask tough questions. Biracial (Sioux & Swede). #TheMighty200 #1A MN Refugee #CyberTheologian", "url": "https://t.co/891J0c6Owk", "entities": {"url": {"urls": [{"url": "https://t.co/891J0c6Owk", "expanded_url": "http://www.ReconComputing.com", "display_url": "ReconComputing.com", "indices": [0, 23]}]}, "description": {"urls": []}}, "protected": false, "followers_count": 9629, "friends_count": 10448, "listed_count": 586, "created_at": "Fri Sep 18 02:58:17 +0000 2009", "favourites_count": 102447, "utc_offset": null, "time_zone": null, "geo_enabled": false, "verified": false, "statuses_count": 335839, "lang": null, "contributors_enabled": false, "is_translator": false, "is_translation_enabled": false, "profile_background_color": "C0DEED", "profile_background_image_url": "http://abs.twimg.com/images/themes/theme1/bg.png", "profile_background_image_url_https": "https://abs.twimg.com/images/themes/theme1/bg.png", "profile_background_tile": false, "profile_image_url": "http://pbs.twimg.com/profile_images/554705065047388160/15Pf51AV_normal.jpeg", "profile_image_url_https": "https://pbs.twimg.com/profile_images/554705065047388160/15Pf51AV_normal.jpeg", "profile_banner_url": "https://pbs.twimg.com/profile_banners/75184478/1398269871", "profile_image_extensions_alt_text": null, "profile_banner_extensions_alt_text": null, "profile_link_color": "1DA1F2", "profile_sidebar_border_color": "C0DEED", "profile_sidebar_fill_color": "DDEEF6", "profile_text_color": "333333", "profile_use_background_image": true, "has_extended_profile": false, "default_profile": true, "default_profile_image": false, "following": false, "follow_request_sent": false, "notifications": false, "translator_type": "none", "withheld_in_countries": []}, "geo": null, "coordinates": null, "place": null, "contributors": null, "is_quote_status": false, "retweet_count": 2, "favorite_count": 3, "favorited": false, "retweeted": false, "possibly_sensitive": false, "lang": "en"}, "is_quote_status": false, "retweet_count": 2, "favorite_count": 0, "favorited": false, "retweeted": false, "lang": "en"}

I really appreciate if you can help me with this. Thank you!