About

The COVID-19 pandemic has resulted in more than 100 million infections, and more than 2 million casualties. The global crisis spans across 200 countries. Large scale testing, social distancing, and face masks have been critical measures to help contain the spread of the infection. Even with the onset of the vaccination programs, the WHO highlights large scale testing and precautionary measures must be followed for the next couple of years. While the list of symptoms is regularly updated, it is established that in symptomatic cases COVID-19 seriously impairs normal functioning of the respiratory system. Does this alter the acoustic characteristics of breathe, cough, and speech sounds produced through the respiratory system? This is an open question waiting for scientific insights. A COVID-19 diagnosis methodology based on acoustic signal analysis, if successful, can provide a remote, scalable, and economical means for testing of individuals. This can supplement the existing nucleotides based COVID-19 testing methods, such as RT-PCR and RAT.
The DiCOVA Challenge is designed to find scientific and engineering insights to the question by enabling participants to analyze an acoustic dataset gathered from COVID-19 positive and non-COVID-19 individuals. The selected findings will be presented in a special session at Interspeech 2021, the flagship conference of the global speech science and technology community, to be held in Brno from Aug 31-Sept 3, 2021. The timeliness, and the global societal importance of the challenge warrants focussed effort from researchers across the globe, including from the fields of medical and respiratory sciences, signal processing, and machine learning engineers/researchers. We look forward to your participation!

The Challenge is closed. See the Results section to know more.
DiCOVA Challenge paper is up! arXiv link
The Track-1 Challenge Leaderboard is live! Click here
Click here to download the flyer.

Results

The DiCOVA Track-1 Challenge (COVID-19 detection fromm cough sounds) received registrations from 85 teams, spread across the globe and coming from industry, academia and independent individuals. All these teams were sent the challenge datasets. Of these teams, 29 participated in evaluating their systems against the blind test set (233 audio files). For this, a leaderboard was set up in Codalab and teams posted a COVID probability score for each test audio file. In response, they received the AUC score (area under the ROC curve) computed over the 233 test audio files. A high AUC (0-100%) implies better performance. Team T-1 posted an AUC of 87.04% and finished on top of the leaderboard. On the right you can see the classification performance of this team on the blind test set. Below we illustrate a few of our observations on the activity seen on the leaderboard.

The leaderboard saw participation from 29 teams.

Each team was given a maximum of 25 attempts to evaluate their system performance against the hidden blind test labels. The AUCs of many of these systems performed better than the baseline system.

There was a good diversity in kinds of features used by the teams. These features ranged from simple hand-crafted acoustic features (like, ZCR, energy) to advanced acoustic representations (embeddings) obtained using pre-trained DNNs.

The novelty of the task made teams also experiment with diverse kinds of classifiers.

The challenge task required handling class data imbalance. For this, several teams experimented with data augmentation (adding noise, reverberation, pitch shifting, etc., or cough files from other public datasets, like COUGHVID), and system fusion.

The best performance was posted by team T-1 with an AUC of 87.04%, significantly improving over the baseline system performance (69.85%). This performance was followed by two close competitors, team T-2 posting 85.43% AUC and team T-3 posting 85.35% AUC. It was wonderful to see nine teams scores above 80% AUC!

The evaluation was open for 22 days. In the initial days only a few teams evaluated their systems. As days passed, the leaderboard activity began to gain pace, and teams started improving their AUCs.

How did the best AUC on the leaderboard change over evaluation days?

Does more evaluation by a team imply a better AUC? There is some correlation :)!

How does the performance on the test set compare against the performance on the val set?

An important metric in evaluating a diagnosis tool is its specificity at some sensitivity. For the challenge we evaluated the specificity at 80% sensitivity. Below we show how different systems fared in this. The best specificity obtained was 83.33% by team T-1.

And finally, here are the ROCs of the 29 systems corresponding to the best system of each team.

System Reports

Few teams have given the consent to share their system reports here. You can view these below and know more about the designed systems.

Team T-1: The Brogrammers DiCOVA 2021 Challenge System Report: Members: Saranga Kingkor Mahanta, Shubham Jain, Darsh Kaushik. Get PDF BIB
Team T-2: DiCOVA-Net: Diagnosing COVID-19 using Acoustics based on Deep Residual Network for the DiCOVA Challenge 2021: Members: Jiangeng Chang, Shaoze Cui, Mengling Feng. Get PDF BIB
Team T-3: UIUC SST DiCOVA 2021 Challenge System Report: Members: John Harvill, Yash Wani, Mark Hasegawa-Johnson, Narendra Ahuja, David Beiser, David Chestek. Get PDF BIB
Team T-4: The North DiCOVA 2021 Challenge System Report: Members: Isabella Sodergren, Maryam Pahlavan Nodeh, Konstantina Nikolaidou, Prakash Chandra Chhipa, Gyorgy Kovacs. Get PDF BIB
Team T-5: Samsung R and D Bangalore DiCOVA 2021 Challenge System Report: Members: Vishwanath Pratap Singh, Shashi Kumar, Ravi Shekhar Jha. Get PDF BIB
Team T-6: HLT-NUS DiCOVA 2021 Challenge System Report: Members: Rohan Kumar Das, Maulik Madhavi and Haizhou Li. Get PDF BIB
Team T-7: COVID-19 Detection Using Recorded Coughs in the 2021 DiCOVA Challenge: Members: Benjamin Elizalde, Daniel Tompkins. Get PDF
Team T-9: SpeechAUC System for Diagnosing COVID-19 Using Acoustics Challenge 2021: Members: Flavio Avila, Amir H. Poorjam, Deepak Mittal, Charles Dognin, Ananya Muguli, Rohit Kumar, Srikanth Raj Chetupalli, Sriram Ganapathy, Maneesh Singh. Get PDF BIB
Team T-15: PANACEA cough sound-based diagnosis of COVID-19 for the DiCOVA 2021 Challenge: Members: Madhu R. Kamble, Jose A. Gonzalez-Lopez, Teresa Grau, Juan M. Espin, Lorenzo Cascioli, Yiqing Huang, Alejandro Gomez-Alanis, Jose Patino, Roberto Font, Antonio M. Peinado, Angel M. Gomez, Nicholas Evans, Maria A. Zuluaga, Massimiliano Todisco. Get PDF
Team T-16: A Residual Network based Deep Learning Model for Detection of COVID-19 from Cough Sounds: Members: Annesya Banerjee, Achal Nilhani. Get PDF
Team T-18: TCS R& I-SNLP DiCOVA 2021 Challenge System Report: Members: Swapnil Bhosale, Upasana Tiwari, Rupayan Chakraborty, Sunil Kumar Kopparapu. Get PDF BIB
Team T-21: EIHW-MTG DiCOVA 2021 Challenge System Report: Members: Adria Mallol-Ragolta, Helena Cuesta, Emilia Gomez, and Bjorn W. Schuller. Get PDF BIB
Team T-24 (Baseline): DiCOVA Challenge: Dataset, task, and baseline system for COVID-19 diagnosis using acoustics: Members: Ananya Muguli, Lancelot Pinto, Nirmala R., Neeraj Sharma, Prashant Krishnan, Prasanta Kumar Ghosh, Rohit Kumar, Shrirama Bhat, Srikanth Raj Chetupalli, Sriram Ganapathy, Shreyas Ramoji, Viral Nanda. Get PDF

Tracks

This special session features two tracks and you can participate in one or both of them. The Track-1 is focussed only on cough sound recordings, and Track-2 is open for use of broader sound categories, like, cough, breath, sustained phonation, and continuous speech.
You are encouraged to submit your findings to the DiCOVA Special Session at Interspeech 2021 for peer-review and subsequent consideration for presentation (and publication) in the conference. For this we require you to participate in one or both the tracks.

Track-1: Cough Sound
Click to expand.
Track-2: Multi Sound
Click to expand.

Frequently Asked Questions

Q. Which programming languages can I use?: A. You are free to use any programming language you like. For system evaluation we will require you to submit the output decisions as a CSV/TXT file.
Q. How big is the Track-1 challenge dataset?: A. The train-val dataset for Track-1 contains a total of ~1.36 hrs of cough audio recordings from 75 COVID-19+ve subjects and 965 non-COVID-19 subjects. The compressed zip file size is 160 MB only. The audio data is compressed as .FLAC, sampling rate 44.1 kHz.
Q. How do I get the DiCOVA audio dataset?: A. It is simple - by registering for the challenge. Please see the registration section in this webpage (above).
Q. Can I re-distribute the data?: A. Yes but only after obtaining consent of the organizers.
Q. Are there other datasets I can use?: A. For both Track-1 and Track-2 you are not allowed to use Project Coswara data. You can use any other data with proper citation of the source in the report and the Interspeech manuscript.
Q. How do I submit my findings obtained by participating in this challenge to Interspeech 2021?: A. That's great! You can follow the Interspeech 2021 paper submission portal here. Remember to select "Special Session DiCOVA" while uploading your paper there.
Q. Can I obtain/use the DiCOVA audio data without participating in the challenge?: A. No. We might re-consider this answer after the end of the challenge. Please contact us then.

About

The Challenge is closed. See the Results section to know more.

DiCOVA Challenge paper is up! arXiv link

The Track-1 Challenge Leaderboard is live! Click here

Click here to download the flyer.

Results

System Reports

Timeline (Tentative) [23:59hrs AOE]

Tracks

Register

Organizers

Frequently Asked Questions

Contact Us