Skip to content

Commit 1f91db2

Browse files
committed
added BIRDCLEF+ 2025 challenge info
1 parent 95dfc56 commit 1f91db2

1 file changed

Lines changed: 39 additions & 0 deletions

File tree

data-science-challenges.md

Lines changed: 39 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -9,6 +9,45 @@ problems using data analysis and machine learning techniques. Below are challeng
99
together with a team:
1010

1111

12+
---
13+
<br>
14+
15+
## **BIRDCLEF+ 2025**
16+
> **by Cornell Lab of Ornithology, USA**
17+
> *May 2025*
18+
19+
![Python](https://img.shields.io/badge/Python-yellow?style=flat&logo=python&logoColor=white)
20+
![Scikit-Learn](https://img.shields.io/badge/scikit--learn-%23F7931E.svg?style=flat&logo=scikitlearn&logoColor=white)
21+
![NumPy](https://img.shields.io/badge/NumPy-%23013243.svg?style=flat&logo=numpy&logoColor=white)
22+
![PyTorch](https://img.shields.io/badge/PyTorch-%23EE4C2C?style=flat&logo=pytorch&logoColor=white)
23+
24+
[Learn more about the challenge](https://www.imageclef.org/BirdCLEF2025)
25+
26+
**Overview**
27+
The task was to develop a sound classification model that determines which of
28+
206 different
29+
animal species are present in a soundclip, taken in a wild life reservoir.
30+
Such models are used to quantifiy the success of environmental restoration
31+
programs. The idea is that a high biodiversity and the presence of certain
32+
key species indicates a healthy environment and therefore the success of the
33+
restoration program.
34+
35+
**Approach**
36+
I used the PANN audio model, which was pretrained on 5000 hours of sound and
37+
predicts 527 sound classes. With the help of this model, I created
38+
embeddings of the sound training data from the BIRDCLEF challenge. The idea
39+
is that this pretrained model is already capable of finding useful features
40+
in sound data, and can therefore be used as a feature extractor. To achieve
41+
this, I removed the output layer of the PANN, and used the output of the
42+
layer before as the embeddings. Then I trained another neural network model
43+
with far fewer parameters than PANN to map from those embeddings to the
44+
probabilities of the 206 animal classes of the BIRDCLEF challenge.
45+
Unfortunately, the challenge hosts required a minimal runtime of the
46+
approach in their cloud environment, which I could not meet with my approach,
47+
which is why I was unable to submit and achieve a ranking.
48+
49+
50+
1251
---
1352
<br>
1453

0 commit comments

Comments
 (0)