We have discontinued our cloud-based data annotation platform since Oct 31st. Contact us for private deployment options.
BasicAI Cloud
Open-Source Licenses
1. Usage of Datasets in BasicAI Cloud
Please be advised that the open-source datasets provided through BasicAI Cloud are intended for internal research and testing purposes only. BasicAI does not create, curate, or screen these datasets, nor do we make any representations or warranties regarding the accuracy, legality, integrity, or appropriateness of their content. BasicAI shall not be held liable for any results obtained from the datasets or for any actions undertaken in reliance upon them. Users who utilize the datasets agree to hold harmless and indemnify BasicAI against any threats, claims, or legal proceedings that may arise from such use. Should you encounter any issues related to copyright infringement, please reach out to us at the following email: product@basic.ai .
2. Attribution for Open Source Datasets in BasicAI Cloud
The following attributions pertain to the open-source datasets available on BasicAI Cloud. Users are required to comply strictly with the licensing agreements governing the proper use of these datasets. While we have endeavored to ensure that the licensing information we provide is comprehensive and accurate, BasicAI does not warrant that there will be no errors in the information furnished. Users are therefore encouraged to independently confirm the veracity of the licensing details provided.
COCO 2017
Licensed under CC BY-SA 4.0.
Copyright Tsung-Yi Lin, Michael Maire, Serge Belongie, Lubomir Bourdev, Ross Girshick, James Hays, Pietro Perona, Deva Ramanan, C. Lawrence Zitnick, Piotr Dollár (https://arxiv.org/abs/1405.0312)
Yolo OpenCV Images
Licensed under CC0: Public Domain
Semantic Segmentation of Aerial Imagery
Licensed under CC0: Public Domain
NIH Chest Xray
Licensed under CC0: Public Domain
Pandaset
Licensed under CC BY 4.0
Copyright Scale AI, Inc. and Hesai Photonics Technology Co., Ltd
CODD (https://github.com/eduardohenriquearnold/CODD)
Licensed under CC BY-SA 4.0
Copyright Eduardo Arnold, Sajjad Mozaffari and Mehrdad Dianati
Human Action Recognition
Licensed under Open Data Commons Open Database License (ODbL)
Copyright DPhi, sponsored by Ajai Karthick, Mathias Rackson and Ankush (https://aiplanet.com/challenges/233/data-sprint-76-human-activity-recognition-233)
Visual Question Answering (https://visualqa.org)
Licensed under CC BY-SA 4.0
Copyright Antol, Stanislaw and Agrawal, Aishwarya and Lu, Jiasen and Mitchell, Margaret and Batra, Dhruv and Zitnick, C Lawrence and Parikh, Devi (https://arxiv.org/pdf/1505.00468v7.pdf)
CREMA-D (https://github.com/CheyneyComputerScience/CREMA-D)
Licensed under Open Data Commons Attribution License (ODC-By)
Copyright David Cooper Cheyney
Santa Barbara Corpus of Spoken American English
Licensed under CC BY-ND 3.0 US
The Santa Barbara Corpus was compiled by researchers in the Linguistics Department of the University of California, Santa Barbara. The Director of the Santa Barbara Corpus is John W. Du Bois, working with Associate Editors Wallace L. Chafe and Sandra A. Thompson (all of UC Santa Barbara), and Charles Meyer (UMass, Boston). For the publication of Parts 3 and 4, the authors are John W. Du Bois and Robert Englebretson.
Kinetics 400
Licensed under CC BY 4.0
Copyright DeepMind Technologies Co., Ltd
D²-City
Licensed under CC BY 4.0
Copyright Zhengping Che, Bo Jiang, Yiping Meng, Guangyu Li, Tracy Li, Ke Dong, Xinsheng Zhang, Xuefeng Shi, Ying Lyu, Guobin Wu, Yan Liu, Jian Tang, and Jieping Ye (https://arxiv.org/pdf/1904.01975.pdf)
Spam Text Message Classification
Licensed under CC0: Public Domain
Movie Archives
Licensed under CC0: Public Domain
Copyright Library of Congress
RLHF Data for LLM Science Exam
Copyright Ertuğrul Demir
MedQuAD: the Medical Question Answering Dataset
Licensed under CC BY-SA 4.0
Copyright "A Question-Entailment Approach to Question Answering". Asma Ben Abacha and Dina Demner-Fushman. BMC Bioinformatics, 2019.