iangow / se_features

Linguistic features derived from StreetEvents
1 stars 3 forks source link

Run NER_class_alt_4 #24

Closed Yvonne-Han closed 4 years ago

iangow commented 4 years ago

@Yvonne-Han Yes, I think it's a good idea to run this for completeness. Maybe over a weekend.

Yvonne-Han commented 4 years ago

@Yvonne-Han Yes, I think it's a good idea to run this for completeness. Maybe over a weekend.

Sure. Last time I tracked the total time and it took ~70 hours to run ner_class_alt_7, so I will probably start on a Friday evening.

Yvonne-Han commented 4 years ago

@iangow I ran the code to create ner_class_alt_4 just now (23:37:35 24/04/2020 AEST).

iangow commented 4 years ago

You mean 2020-04-24 23:37:35 AEST?

Source: XKCD

Yvonne-Han commented 4 years ago

You mean 2020-04-24 23:37:35 AEST?

Source: XKCD

Next time I will! I thought you'd prefer the Australian way šŸ˜¢

If you are not that busy today, could you please take a look at the table here? I did a summary of my findings and would like to see whether you want to fix the mislabelling of pres and qa.

Yvonne-Han commented 4 years ago

@iangow ner_class_alt_4 is now successfully created (2020-04-27 16:00:51 AEST), and below is a preview of the table:

library(dplyr, warn.conflicts = FALSE)
library(DBI)
library(reprex)

pg <- dbConnect(RPostgres::Postgres())
rs <- dbExecute(pg, "SET search_path TO se_features")

ner_class_alt_4 <- tbl(pg, "ner_class_alt_4")
ner_class_alt_4
#> # Source:   table<ner_class_alt_4> [?? x 6]
#> # Database: postgres [yanzih1@10.101.13.99:5432/crsp]
#>    file_name last_update         speaker_number section context ner_tags   
#>    <chr>     <dttm>                       <int>   <int> <chr>   <chr>      
#>  1 3668737_T 2011-01-14 04:57:11              5       1 pres    {}         
#>  2 3668737_T 2011-01-14 04:57:11              4       1 pres    "{\"PERSONā€¦
#>  3 3668737_T 2011-01-14 04:57:11              3       1 pres    "{\"MISC\"ā€¦
#>  4 3668737_T 2011-01-14 04:57:11              2       1 pres    "{\"PERSONā€¦
#>  5 3668737_T 2011-01-14 04:57:11              1       1 pres    "{\"PERSONā€¦
#>  6 2516454_T 2009-10-26 22:49:41             39       1 qa      {}         
#>  7 2516454_T 2009-10-26 22:49:41             38       1 qa      {}         
#>  8 2516454_T 2009-10-26 22:49:41             37       1 qa      {}         
#>  9 2516454_T 2009-10-26 22:49:41             36       1 qa      "{\"PERSONā€¦
#> 10 2516454_T 2009-10-26 22:49:41             35       1 qa      "{\"PERSONā€¦
#> # ā€¦ with more rows

Created on 2020-04-27 by the reprex package (v0.3.0)

I'm closing this issue now.