RobertMyles / tidyRSS

An R package for extracting 'tidy' data frames from RSS, Atom and JSON feeds
https://robertmyles.github.io/tidyRSS/
Other
82 stars 20 forks source link

entry_category & item_category errors #44

Closed RobertMyles closed 4 years ago

RobertMyles commented 4 years ago

Error: Columnitem_categorymust be length 15 (the number of rows) or one, not 7 Error: Columnentry_categorymust be length 25 (the number of rows) or one, not 19

Seen with:

RobertMyles commented 4 years ago

Fixed:

> tidyfeed("http://d.hatena.ne.jp/bob3/rss")
GET request successful. Parsing...

# A tibble: 30 x 15
   feed_title feed_url feed_last_updated   feed_author feed_link feed_generator entry_title entry_url entry_last_updated  entry_author
 * <chr>      <chr>    <dttm>              <chr>       <chr>     <chr>          <chr>       <chr>     <dttm>              <chr>       
 1 bob3’s bl… hatenab… 2020-01-14 09:51:06 bob3        http://b… Hatena::Blog   R に Intel … hatenabl… 2020-01-14 17:56:40 bob3        
 2 bob3’s bl… hatenab… 2020-01-14 09:51:06 bob3        http://b… Hatena::Blog   broomパッケージ… hatenabl… 2019-08-18 22:10:09 bob3        
 3 bob3’s bl… hatenab… 2020-01-14 09:51:06 bob3        http://b… Hatena::Blog   続々・複数のファイル… hatenabl… 2019-07-09 22:21:29 bob3        
 4 bob3’s bl… hatenab… 2020-01-14 09:51:06 bob3        http://b… Hatena::Blog   続・複数のファイルを… hatenabl… 2019-07-09 00:12:29 bob3        
 5 bob3’s bl… hatenab… 2020-01-14 09:51:06 bob3        http://b… Hatena::Blog   base と Tid… hatenabl… 2019-07-07 21:35:14 bob3        
 6 bob3’s bl… hatenab… 2020-01-14 09:51:06 bob3        http://b… Hatena::Blog   複数のファイルを一度… hatenabl… 2019-07-07 14:48:10 bob3        
 7 bob3’s bl… hatenab… 2020-01-14 09:51:06 bob3        http://b… Hatena::Blog   R本体のアップデート… hatenabl… 2019-07-06 23:03:56 bob3        
 8 bob3’s bl… hatenab… 2020-01-14 09:51:06 bob3        http://b… Hatena::Blog   Rを使ってExcel… hatenabl… 2019-06-04 23:42:37 bob3        
 9 bob3’s bl… hatenab… 2020-01-14 09:51:06 bob3        http://b… Hatena::Blog   vroomパッケージ… hatenabl… 2019-05-18 12:35:22 bob3        
10 bob3’s bl… hatenab… 2020-01-14 09:51:06 bob3        http://b… Hatena::Blog   lavaanで作った… hatenabl… 2019-03-02 14:31:16 bob3        
# … with 20 more rows, and 5 more variables: entry_content <chr>, entry_link <chr>, entry_summary <chr>, entry_category <list>,
#   entry_published <dttm>
> tidyfeed("http://allthiswasfield.blogspot.com/feeds/posts/default?alt=rss")
GET request successful. Parsing...

# A tibble: 16 x 15
   feed_title feed_link feed_description feed_managing_e… feed_pub_date       feed_last_build_da… feed_category feed_generator
 * <chr>      <chr>     <chr>            <chr>            <dttm>              <dttm>              <chr>         <chr>         
 1 long time… http://a… Ecological mode… noreply@blogger… 2020-04-03 23:16:00 2020-04-03 23:16:07 R             Blogger       
 2 long time… http://a… Ecological mode… noreply@blogger… 2020-04-03 23:16:00 2020-04-03 23:16:07 R             Blogger       
 3 long time… http://a… Ecological mode… noreply@blogger… 2020-04-03 23:16:00 2020-04-03 23:16:07 R             Blogger       
 4 long time… http://a… Ecological mode… noreply@blogger… 2020-04-03 23:16:00 2020-04-03 23:16:07 R             Blogger       
 5 long time… http://a… Ecological mode… noreply@blogger… 2020-04-03 23:16:00 2020-04-03 23:16:07 R             Blogger       
 6 long time… http://a… Ecological mode… noreply@blogger… 2020-04-03 23:16:00 2020-04-03 23:16:07 R             Blogger       
 7 long time… http://a… Ecological mode… noreply@blogger… 2020-04-03 23:16:00 2020-04-03 23:16:07 R             Blogger       
 8 long time… http://a… Ecological mode… noreply@blogger… 2020-04-03 23:16:00 2020-04-03 23:16:07 R             Blogger       
 9 long time… http://a… Ecological mode… noreply@blogger… 2020-04-03 23:16:00 2020-04-03 23:16:07 R             Blogger       
10 long time… http://a… Ecological mode… noreply@blogger… 2020-04-03 23:16:00 2020-04-03 23:16:07 R             Blogger       
11 long time… http://a… Ecological mode… noreply@blogger… 2020-04-03 23:16:00 2020-04-03 23:16:07 R             Blogger       
12 long time… http://a… Ecological mode… noreply@blogger… 2020-04-03 23:16:00 2020-04-03 23:16:07 R             Blogger       
13 long time… http://a… Ecological mode… noreply@blogger… 2020-04-03 23:16:00 2020-04-03 23:16:07 R             Blogger       
14 long time… http://a… Ecological mode… noreply@blogger… 2020-04-03 23:16:00 2020-04-03 23:16:07 R             Blogger       
15 long time… http://a… Ecological mode… noreply@blogger… 2020-04-03 23:16:00 2020-04-03 23:16:07 R             Blogger       
16 long time… http://a… Ecological mode… noreply@blogger… 2020-04-03 23:16:00 2020-04-03 23:16:07 R             Blogger       
# … with 7 more variables: item_title <chr>, item_link <chr>, item_description <chr>, item_pub_date <dttm>, item_guid <chr>,
#   item_author <chr>, item_category <list>
> tidyfeed("http://allthingsdatascience.blogspot.com/feeds/posts/default")
GET request successful. Parsing...

# A tibble: 25 x 14
   feed_title feed_url feed_last_updated   feed_author feed_link feed_generator entry_title entry_url entry_last_updated  entry_author
 * <chr>      <chr>    <dttm>              <chr>       <chr>     <chr>          <chr>       <chr>     <dttm>              <chr>       
 1 All Thing… tag:blo… 2020-03-15 07:50:02 Istvan Haj… http://a… Blogger        "Data Scie… tag:blog… 2020-01-25 01:41:34 Istvan Hajn…
 2 All Thing… tag:blo… 2020-03-15 07:50:02 Istvan Haj… http://a… Blogger        "Besprekin… tag:blog… 2018-05-10 15:20:30 Istvan Hajn…
 3 All Thing… tag:blo… 2020-03-15 07:50:02 Istvan Haj… http://a… Blogger        "(small) s… tag:blog… 2016-12-01 09:06:54 Istvan Hajn…
 4 All Thing… tag:blo… 2020-03-15 07:50:02 Istvan Haj… http://a… Blogger        "Market Re… tag:blog… 2015-02-22 05:40:46 Istvan Hajn…
 5 All Thing… tag:blo… 2020-03-15 07:50:02 Istvan Haj… http://a… Blogger        "Wat een L… tag:blog… 2017-12-27 01:16:29 Istvan Hajn…
 6 All Thing… tag:blo… 2020-03-15 07:50:02 Istvan Haj… http://a… Blogger        "So what's… tag:blog… 2014-06-22 01:14:18 Istvan Hajn…
 7 All Thing… tag:blo… 2020-03-15 07:50:02 Istvan Haj… http://a… Blogger        "Hebben 'v… tag:blog… 2014-05-12 03:33:15 Istvan Hajn…
 8 All Thing… tag:blo… 2020-03-15 07:50:02 Istvan Haj… http://a… Blogger        "Hoe moord… tag:blog… 2014-04-19 01:38:50 Istvan Hajn…
 9 All Thing… tag:blo… 2020-03-15 07:50:02 Istvan Haj… http://a… Blogger        "Over Lamp… tag:blog… 2013-10-26 02:14:58 Istvan Hajn…
10 All Thing… tag:blo… 2020-03-15 07:50:02 Istvan Haj… http://a… Blogger        "Managing … tag:blog… 2013-10-23 08:19:07 Istvan Hajn…
# … with 15 more rows, and 4 more variables: entry_content <chr>, entry_link <chr>, entry_category <list>, entry_published <dttm>
>