
0
2
4
6
0 100 200 300
Duration of the DVS−recording [s]
Runtime [s]
group
Writing
Reading
Pre−processing
Remapping
Clustering
LOWESS
Post−processing
Figure 11: The runtime plotted against the length of the
DVS-recording and separated into the individual steps of
the processing pipeline.
age runtime varies with the DVS-recording duration
and how it is distributed among the different process-
ing steps. The runtime was measured on an AMD
Ryzen 7 5700U CPU running Ubuntu 20.04 and aver-
aged over 100 runs. As can be seen from Figure 11,
the processing runtime is always considerably shorter
than the duration of the recording.
We also measured the speedup S due to pre-
processing, defined as the runtime without subsam-
pling divided by the runtime with subsampling. The
speedup for noise filtering was greater in unnoisy data
(S ≈ 20) than in noisy data (S ≈ 4). This is because
subsampling reduces the number of noisy points less
effectively, as many noise points occupy their own
cubes. The speedup for noisy data increases with
noise removal during subsampling. This in partic-
ular affected the MST-based clustering which had a
speedup of 30. This means that subsampling is essen-
tial for an application in real time.
4 CONCLUSIONS
We have developed a method for fast instance seg-
mentation of insect flight tracks in DVS data, treating
time as another dimension to preserve high temporal
resolution. The central part of the algorithm is a den-
sity based clustering, for which either DBSCAN or
MST-based clustering can be chosen. The MST-based
clustering was considerably more robust with respect
to noise and is thus preferable. Both algorithms, how-
ever, failed to separate close by tracks, and we have
implemented a post-processing step that remedies this
shortcoming in most situations.
Due to subsampling during pre-processing, the
method has a runtime much shorter than the DVS
recording duration and is thus applicable in real time.
Noise removal was optionally included in the subsam-
pling step and in the clustering step. These automatic
noise detection makes the method quite robust in the
presence of noise, which is important for its deploy-
ment in natural scenarios.
Although the method has an accuracy comparable
to a manual segmentation by a human, it occasionally
removes thin tracks. This makes the method currently
less effective for small insects like mosquitos. For
visualisation or further analysis, we also fit flight tra-
jectories through the returned clusters. Although our
usage of LOWESS was satisfactory, a local regres-
sion based on the number of neighbours can become
problematic for scenarios with a wide range of track
thicknesses and lengths. It would be interesting to in-
vestigate different local regression methods, e.g. by
basing the local region on fixed time intervals.
Although we tested our method only with DVS
data with two spatial dimensions, there is nothing spe-
cial in our algorithm that restricts it to 2D data. The
method can thus readily be deployed to 3D data like
that recorded by the new system developed in the Bee-
Vision project (Pohle-Fr
¨
ohlich et al., 2024), which
aims at counting insect populations over a long pe-
riod. We plan to deploy our algorithm in this project
and use it as a second step after semantic segmenta-
tion. This will then be followed by a classification
step which leads to an automatic counting of species
occurances in the data.
REFERENCES
Cleveland, W. S. (1979). Robust locally weighted regres-
sion and smoothing scatterplots. Journal of the Amer-
ican Statistical Association, 74(368):829–836.
Dalitz, C., Ayyad, Y., Wilberg, J., Aymans, L., Bazin, D.,
and Mittig, W. (2019a). Automatic trajectory recog-
nition in Active Target Time Projection Chambers
data by means of hierarchical clustering. Computer
Physics Communications, 235:159–168.
Dalitz, C., Wilberg, J., and Aymans, L. (2019b). TriplClust:
An algorithm for curve detection in 3D point clouds.
Image Processing On Line, 8:26–46.
Ester, M., Kriegel, H.-P., Sander, J., Xu, X., et al. (1996).
A density-based algorithm for discovering clusters in
large spatial databases with noise. In kdd, volume 96,
pages 226–231.
Hallmann, C. A., Sorg, M., Jongejans, E., Siepel, H.,
Hofland, N., Schwan, H., Stenmans, W., M
¨
uller,
A., Sumser, H., H
¨
orren, T., et al. (2017). More
VISAPP 2025 - 20th International Conference on Computer Vision Theory and Applications
76