jqueguiner / camembert-as-a-service

MIT License
8 stars 3 forks source link

CamemBERT API will complete your masked french text such as

According to CamemBERT's team on camembert-model.fr

CamemBERT is a state-of-the-art language model for French based on the RoBERTa architecture pretrained on the French subcorpus of the newly available multilingual corpus OSCAR. We evaluate CamemBERT in four different downstream tasks for French: part-of-speech (POS) tagging, dependency parsing, named entity recognition (NER) and natural language inference (NLI); improving the state of the art for most tasks over previous monolingual and multilingual approaches, which confirms the effectiveness of large pretrained language models for French. CamemBERT was trained and evaluated by Louis Martin, Benjamin Muller, Pedro Javier Ortiz Suárez, Yoann Dupont, Laurent Romary, Éric Villemonte de la Clergerie, Djamé Seddah and Benoît Sagot.

@ARTICLE{2019arXiv191103894M,
       author = {{Martin}, Louis and {Muller}, Benjamin and
         {Ortiz Su{\'a}rez}, Pedro Javier and {Dupont}, Yoann and
         {Romary}, Laurent and {Villemonte de la Clergerie}, {\'E}ric and
         {Seddah}, Djam{\'e} and {Sagot}, Beno{\^\i}t},
        title = "{CamemBERT: a Tasty French Language Model}",
      journal = {arXiv e-prints},
     keywords = {Computer Science - Computation and Language},
         year = "2019",
        month = "Nov",
          eid = {arXiv:1911.03894},
        pages = {arXiv:1911.03894},
archivePrefix = {arXiv},
       eprint = {1911.03894},
 primaryClass = {cs.CL},
       adsurl = {https://ui.adsabs.harvard.edu/abs/2019arXiv191103894M},
      adsnote = {Provided by the SAO/NASA Astrophysics Data System}
}

EXAMPLE output


INPUT

{
  "text": "Le camembert est <mask> :)",
  "top_k": 5
}

EXECUTION

curl -X POST "https://api-market-place.ai.ovh.net/text-camembert/process" -H "accept: application/json" -H "X-OVH-Api-Key: XXXXXXXX-XXXX-XXXX-XXXX-XXXXXXXXXXXX" -H "Content-Type: application/json" -d '{ "text": "Le camembert est <mask> :)",  "top_k": 5}'

OUTPUT

[
  [
    "Le camembert est délicieux :)",
    0.49091199040412903,
    " délicieux"
  ],
  [
    "Le camembert est excellent :)",
    0.10556956380605698,
    " excellent"
  ],
  [
    "Le camembert est succulent :)",
    0.03453320264816284,
    " succulent"
  ],
  [
    "Le camembert est meilleur :)",
    0.033031098544597626,
    " meilleur"
  ],
  [
    "Le camembert est parfait :)",
    0.03007647767663002,
    " parfait"
  ],
  [
    "Le camembert est bon :)",
    0.02145528607070446,
    " bon"
  ],
  [
    "Le camembert est délicieuse :)",
    0.015332158654928207,
    " délicieuse"
  ],
  [
    "Le camembert est magnifique :)",
    0.012028267607092857,
    " magnifique"
  ],
  [
    "Le camembert est savoureux :)",
    0.009316671639680862,
    " savoureux"
  ],
  [
    "Le camembert est divin :)",
    0.008841886185109615,
    " divin"
  ]
]

please refer to swagger documentation for further technical details: swagger documentation


Mascot by Alix Chagué © 2019