issues
search
bigscience-workshop
/
data_tooling
Tools for managing datasets for governance and training.
Apache License 2.0
77
stars
48
forks
source link
Create dataset global_voices_arabic
#208
Open
albertvillanova
opened
2 years ago
albertvillanova
commented
2 years ago
uid: global_voices_arabic
type: primary
description:
name: Global Voices Arabic
description: Global Voices pages in Arabic
homepage:
https://ar.globalvoices.org/
validated: True
languages:
language_names:
Arabic
ar-MSA
language_comments:
language_locations:
Middle East and North Africa
validated: False
custodian:
name:
in_catalogue: global_voices
type:
location:
contact_name:
contact_email:
contact_submitter: False
additional:
validated: False
availability:
procurement:
for_download: Yes - it has a direct download link or links
download_url:
https://ar.globalvoices.org/
download_email:
licensing:
has_licenses: Yes
license_text:
https://globalvoices.org/about/global-voices-attribution-policy/
license_properties:
open license
license_list:
cc-by-3.0: Creative Commons Attribution 3.0 Unported
pii:
has_pii: Yes
generic_pii_likely: very likely
generic_pii_list:
names
email addresses
numeric_pii_likely: somewhat likely
numeric_pii_list:
telephone numbers
sensitive_pii_likely: very likely
sensitive_pii_list:
racial or ethnic origin
political opinions
religious or philosophical beliefs
no_pii_justification_class:
no_pii_justification_text:
validated: False
source_category:
category_type: website
category_web: news or magazine website
category_media:
validated: False
media:
category:
text
text_format:
.HTML
audiovisual_format:
image_format:
database_format:
text_is_transcribed: No
instance_type: article
instance_count: 1K<n<10K
instance_size: 100<n<10,000
validated: False
fname: global_voices_arabic.json
apergo-ai
commented
2 years ago
self-assign
apergo-ai
commented
2 years ago
request for permission sent