camila-ud commited on
Commit
3fd893a
1 Parent(s): db7ce2d

Update Dataset source in README

Browse files
Files changed (1) hide show
  1. README.md +12 -3
README.md CHANGED
@@ -12,10 +12,10 @@ tags:
12
  - medkit-lib
13
  widget:
14
  - text: >-
15
- Elle souffre d'asthme mais n'a pas besoin d'Allegra
16
  example_title: example 1
17
  - text: >-
18
- La radiographie et la tomodensitométrie ont montré des micronodules diffus
19
  example_title: example 2
20
  ---
21
 
@@ -30,8 +30,17 @@ It has been trained to detect the following type of entities: **problem**, **tr
30
 
31
  - **Fine-tuned using** medkit [GitHub Repo](https://github.com/TeamHeka/medkit)
32
  - **Developed by** @camila-ud, medkit, HeKA Research team
33
- - **Dataset from** @aneuraz, CASM2
 
 
 
34
 
 
 
 
 
 
 
35
  # Intended uses & limitations
36
 
37
  ## Limitations and bias
 
12
  - medkit-lib
13
  widget:
14
  - text: >-
15
+ La radiographie et la tomodensitométrie ont montré des micronodules diffus
16
  example_title: example 1
17
  - text: >-
18
+ Elle souffre d'asthme mais n'a pas besoin d'Allegra
19
  example_title: example 2
20
  ---
21
 
 
30
 
31
  - **Fine-tuned using** medkit [GitHub Repo](https://github.com/TeamHeka/medkit)
32
  - **Developed by** @camila-ud, medkit, HeKA Research team
33
+ - **Dataset source**
34
+
35
+ Annotated version from @aneuraz called 'corpusCasM2: A corpus of annotated clinical texts'
36
+ - The annotation was performed collaborativelly by the students of masters students from Université Paris Cité.
37
 
38
+ - The corpus contains documents from CAS:
39
+ ```
40
+ Natalia Grabar, Vincent Claveau, and Clément Dalloux. 2018. CAS: French Corpus with Clinical Cases.
41
+ In Proceedings of the Ninth International Workshop on Health Text Mining and Information Analysis,
42
+ pages 122–128, Brussels, Belgium. Association for Computational Linguistics.
43
+ ```
44
  # Intended uses & limitations
45
 
46
  ## Limitations and bias