JohnSnowLabs / nlu

1 line for thousands of State of The Art NLP models in hundreds of languages The fastest and most accurate way to solve text problems.
Apache License 2.0
854 stars 130 forks source link

48 new Transformer based models in 9 new languages, including NER for Finance, Industry, Politcal Policies, COVID and Chemical Trials, various clinical and medical domains in Spanish and English and much more in NLU 3.3.1 #88

Closed C-K-Loan closed 2 years ago

C-K-Loan commented 2 years ago

We are incredibly excited to announce NLU 3.3.1 has been released with 48 new models in 9 languages!

It comes with 2 new types of state-of-the-art models,distilBERT and BERT for sequence classification with various pretrained weights, state-of-the-art bert based classifiers for problems in the domains of Finance, Sentiment Classification, Industry, News and much more.

One the healthcare side, NLU features 22 new models in for English and Spanish with withEntity Resolver Models for LOINC, MeSH, NDC and SNOMED and UMLS Diseases, NER models for Biomarkers, NIHSS-Guidelines, COVID Trials , Chemical Trials, Bert based Token Classifier models for biological, genetical,cancer, cellular terms, Bert for Sequence Classification models for clinical question vs statement classification and finally Spanich Clinical NER and Resolver Models

Once again, we would like to thank our community for making another amazing release possible!

New Open Sourcen Models and Features

Integrates the amazing Spark NLP 3.3.3 and 3.3.2 releases, featuring:

Complete List of Open Source Models :

Language NLU Reference Spark NLP Reference Task
en en.classify.bert_sequence.imdb_large bert_large_sequence_classifier_imdb Text Classification
en en.classify.bert_sequence.imdb bert_base_sequence_classifier_imdb Text Classification
en en.classify.bert_sequence.ag_news bert_base_sequence_classifier_ag_news Text Classification
en en.classify.bert_sequence.dbpedia_14 bert_base_sequence_classifier_dbpedia_14 Text Classification
en en.classify.bert_sequence.finbert bert_sequence_classifier_finbert Text Classification
en en.classify.bert_sequence.dehatebert_mono bert_sequence_classifier_dehatebert_mono Text Classification
tr tr.classify.bert_sequence.sentiment bert_sequence_classifier_turkish_sentiment Text Classification
de de.classify.bert_sequence.sentiment bert_sequence_classifier_sentiment Text Classification
ru ru.classify.bert_sequence.sentiment bert_sequence_classifier_rubert_sentiment Text Classification
ja ja.classify.bert_sequence.sentiment bert_sequence_classifier_japanese_sentiment Text Classification
es es.classify.bert_sequence.sentiment bert_sequence_classifier_beto_sentiment_analysis Text Classification
es es.classify.bert_sequence.emotion bert_sequence_classifier_beto_emotion_analysis Text Classification
xx xx.classify.bert_sequence.sentiment bert_sequence_classifier_multilingual_sentiment Text Classification
en en.classify.distilbert_sequence.sst2 distilbert_sequence_classifier_sst2 Text Classification
en en.classify.distilbert_sequence.policy distilbert_sequence_classifier_policy Text Classification
en en.classify.distilbert_sequence.industry distilbert_sequence_classifier_industry Text Classification
en en.classify.distilbert_sequence.emotion distilbert_sequence_classifier_emotion Text Classification
en en.classify.distilbert_sequence.banking77 distilbert_sequence_classifier_banking77 Text Classification
en en.classify.distilbert_sequence.imdb distilbert_base_sequence_classifier_imdb Text Classification
en en.classify.distilbert_sequence.amazon_polarity distilbert_base_sequence_classifier_amazon_polarity Text Classification
en en.classify.distilbert_sequence.ag_news distilbert_base_sequence_classifier_ag_news Text Classification
fr fr.classify.distilbert_sequence.allocine distilbert_multilingual_sequence_classifier_allocine Text Classification
ur ur.classify.distilbert_sequence.imdb distilbert_base_sequence_classifier_imdb Text Classification
en en.embed_sentence.doc2vec doc2vec_gigaword_300 Embeddings
en en.embed_sentence.doc2vec.gigaword_300 doc2vec_gigaword_300 Embeddings
en en.embed_sentence.doc2vec.gigaword_wiki_300 doc2vec_gigaword_wiki_300 Embeddings

New Healthcare models and Features

Integrates the incredible Spark NLP for Healthcare releases 3.3.4, 3.3.2 and 3.3.1, featuring:

Complete List of Healthcare Models :

Language NLU Reference Spark NLP Reference Task
en en.med_ner.deid_subentity_augmented_i2b2 ner_deid_subentity_augmented_i2b2 Named Entity Recognition
en en.med_ner.biomarker ner_biomarker Named Entity Recognition
en en.med_ner.nihss ner_nihss Named Entity Recognition
en en.extract_relation.nihss redl_nihss_biobert Relation Extraction
en en.resolve.mesh sbiobertresolve_mesh Entity Resolution
en en.resolve.mli sbiobert_base_cased_mli Embeddings
en en.resolve.ndc sbiobertresolve_ndc Entity Resolution
en en.resolve.loinc.augmented sbiobertresolve_loinc_augmented Entity Resolution
en en.resolve.clinical_snomed_procedures_measurements sbiobertresolve_clinical_snomed_procedures_measurements Entity Resolution
es es.embed.roberta_base_biomedical roberta_base_biomedical Embeddings
es es.med_ner.roberta_ner_diag_proc roberta_ner_diag_proc Named Entity Recognition
es es.resolve.snomed robertaresolve_snomed Entity Resolution
en en.med_ner.covid_trials ner_covid_trials Named Entity Recognition
en en.classify.token_bert.bionlp bert_token_classifier_ner_bionlp Named Entity Recognition
en en.classify.token_bert.cellular bert_token_classifier_ner_cellular Named Entity Recognition
en en.classify.token_bert.chemicals bert_token_classifier_ner_chemicals Named Entity Recognition
en en.resolve.rxnorm_augmented sbiobertresolve_rxnorm_augmented Entity Resolution
en en.resolve.rxnorm_augmented sbiobertresolve_rxnorm_augmented Entity Resolution
en en.resolve.rxnorm_augmented sbiobertresolve_rxnorm_augmented Entity Resolution
en en.resolve.umls_disease_syndrome sbiobertresolve_umls_disease_syndrome Entity Resolution
en en.resolve.umls_clinical_drugs sbiobertresolve_umls_clinical_drugs Entity Resolution
en en.classify.bert_sequence.question_statement_clinical bert_sequence_classifier_question_statement_clinical Text Classification