top of page

BasicAI Cloud
Open-Source Licenses

1. Usage of Datasets in BasicAI Cloud

 

Please be advised that the open-source datasets provided through BasicAI Cloud are intended for internal research and testing purposes only. BasicAI does not create, curate, or screen these datasets, nor do we make any representations or warranties regarding the accuracy, legality, integrity, or appropriateness of their content. BasicAI shall not be held liable for any results obtained from the datasets or for any actions undertaken in reliance upon them. Users who utilize the datasets agree to hold harmless and indemnify BasicAI against any threats, claims, or legal proceedings that may arise from such use. Should you encounter any issues related to copyright infringement, please reach out to us at the following email: product@basic.ai .

2. Attribution for Open Source Datasets in BasicAI Cloud

 

The following attributions pertain to the open-source datasets available on BasicAI Cloud. Users are required to comply strictly with the licensing agreements governing the proper use of these datasets. While we have endeavored to ensure that the licensing information we provide is comprehensive and accurate, BasicAI does not warrant that there will be no errors in the information furnished. Users are therefore encouraged to independently confirm the veracity of the licensing details provided.

 

COCO 2017

 

Licensed under CC BY-SA 4.0.
Copyright Tsung-Yi Lin, Michael Maire, Serge Belongie, Lubomir Bourdev, Ross Girshick, James Hays, Pietro Perona, Deva Ramanan, C. Lawrence Zitnick, Piotr Dollár (https://arxiv.org/abs/1405.0312)

 

Yolo OpenCV Images

 

Licensed under CC0: Public Domain

 

Semantic Segmentation of Aerial Imagery

 

Licensed under CC0: Public Domain

 

NIH Chest Xray

 

Licensed under CC0: Public Domain

 

Pandaset

 

Licensed under CC BY 4.0 
Copyright Scale AI, Inc. and Hesai Photonics Technology Co., Ltd

 

CODD  (https://github.com/eduardohenriquearnold/CODD)

 

Licensed under CC BY-SA 4.0
Copyright Eduardo Arnold, Sajjad Mozaffari and Mehrdad Dianati

 

Human Action Recognition

 

Licensed under Open Data Commons Open Database License (ODbL)
Copyright DPhi, sponsored by Ajai Karthick, Mathias Rackson and Ankush (https://aiplanet.com/challenges/233/data-sprint-76-human-activity-recognition-233)

 

Visual Question Answering (https://visualqa.org)

 

Licensed under CC BY-SA 4.0
Copyright Antol, Stanislaw and Agrawal, Aishwarya and Lu, Jiasen and Mitchell, Margaret and Batra, Dhruv and Zitnick, C Lawrence and Parikh, Devi (https://arxiv.org/pdf/1505.00468v7.pdf)

 

CREMA-D (https://github.com/CheyneyComputerScience/CREMA-D)

 

Licensed under Open Data Commons Attribution License (ODC-By)
Copyright David Cooper Cheyney

 

Santa Barbara Corpus of Spoken American English 

 

Licensed under CC BY-ND 3.0 US
The Santa Barbara Corpus was compiled by researchers in the Linguistics Department of the University of California, Santa Barbara. The Director of the Santa Barbara Corpus is John W. Du Bois, working with Associate Editors Wallace L. Chafe and Sandra A. Thompson (all of UC Santa Barbara), and Charles Meyer (UMass, Boston). For the publication of Parts 3 and 4, the authors are John W. Du Bois and Robert Englebretson.

 

Kinetics 400

 

Licensed under CC BY 4.0
Copyright DeepMind Technologies Co., Ltd

 

D²-City

 

Licensed under CC BY 4.0
Copyright Zhengping Che, Bo Jiang, Yiping Meng, Guangyu Li, Tracy Li, Ke Dong, Xinsheng Zhang, Xuefeng Shi, Ying Lyu, Guobin Wu, Yan Liu, Jian Tang, and Jieping Ye (https://arxiv.org/pdf/1904.01975.pdf)

 

Spam Text Message Classification

 

Licensed under CC0: Public Domain

 

Movie Archives

 

Licensed under CC0: Public Domain
Copyright Library of Congress

RLHF Data for LLM Science Exam

 

Copyright Ertuğrul Demir
 

MedQuAD: the Medical Question Answering Dataset

 

Licensed under CC BY-SA 4.0

Copyright "A Question-Entailment Approach to Question Answering". Asma Ben Abacha and Dina Demner-Fushman. BMC Bioinformatics, 2019.

bottom of page