Update README.md
Browse files
README.md
CHANGED
@@ -2,16 +2,24 @@
|
|
2 |
library_name: transformers
|
3 |
language:
|
4 |
- fr
|
|
|
|
|
5 |
---
|
6 |
|
7 |
# Model Card for Model ID
|
8 |
|
9 |
-
|
10 |
|
11 |
|
12 |
|
13 |
## Model Details
|
14 |
|
|
|
|
|
|
|
|
|
|
|
|
|
15 |
### Model Description
|
16 |
|
17 |
<!-- Provide a longer summary of what this model is. -->
|
@@ -58,10 +66,20 @@ This is the model card of a 🤗 transformers model that has been pushed on the
|
|
58 |
|
59 |
## Bias, Risks, and Limitations
|
60 |
|
61 |
-
|
62 |
|
63 |
-
|
|
|
|
|
|
|
|
|
64 |
|
|
|
|
|
|
|
|
|
|
|
|
|
65 |
### Recommendations
|
66 |
|
67 |
<!-- This section is meant to convey recommendations with respect to the bias, risk, and technical limitations. -->
|
|
|
2 |
library_name: transformers
|
3 |
language:
|
4 |
- fr
|
5 |
+
datasets:
|
6 |
+
- OpenLLM-France/Claire-Dialogue-French-0.1
|
7 |
---
|
8 |
|
9 |
# Model Card for Model ID
|
10 |
|
11 |
+
Pretrained model on French language using a masked language modeling (MLM) objective. It was introduced in this paper and first released in this repository.
|
12 |
|
13 |
|
14 |
|
15 |
## Model Details
|
16 |
|
17 |
+
CamemBERT is a state-of-the-art language model for French based on the RoBERTa model.
|
18 |
+
|
19 |
+
It is now available on Hugging Face in 6 different versions with varying number of parameters, amount of pretraining data and pretraining data source domains.
|
20 |
+
|
21 |
+
For further information or requests, please go to Camembert Website
|
22 |
+
|
23 |
### Model Description
|
24 |
|
25 |
<!-- Provide a longer summary of what this model is. -->
|
|
|
66 |
|
67 |
## Bias, Risks, and Limitations
|
68 |
|
69 |
+
Even if the training data used for this model could be characterized as fairly neutral, this model can have biased predictions:
|
70 |
|
71 |
+
### Authors
|
72 |
+
|
73 |
+
CamemBERT was trained and evaluated by Louis Martin*, Benjamin Muller*, Pedro Javier Ortiz Suárez*, Yoann Dupont, Laurent Romary, Éric Villemonte de la Clergerie, Djamé Seddah and Benoît Sagot.
|
74 |
+
|
75 |
+
If you this our work, please cite:
|
76 |
|
77 |
+
@inproceedings{martin2020camembert,
|
78 |
+
title={CamemBERT: a Tasty French Language Model},
|
79 |
+
author={Martin, Louis and Muller, Benjamin and Su{\'a}rez, Pedro Javier Ortiz and Dupont, Yoann and Romary, Laurent and de la Clergerie, {\'E}ric Villemonte and Seddah, Djam{\'e} and Sagot, Beno{\^\i}t},
|
80 |
+
booktitle={Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics},
|
81 |
+
year={2020}
|
82 |
+
}
|
83 |
### Recommendations
|
84 |
|
85 |
<!-- This section is meant to convey recommendations with respect to the bias, risk, and technical limitations. -->
|