
2024), incorporate semantic segmentation to priori-
tize meaningful regions but are computationally in-
tensive due to their iterative processing.
To achieve a high-quality low-sized SVGs, we
propose LIVBOC (Layer-wise Image Vectorization
with Bayesian-Optimized Contours), an image vec-
torization method of two main stages, a Bayesian-
optimized contour-based path initialization stage and
a path optimization stage. Contour-based techniques
have been shown to be effective in capturing shape
boundaries while minimizing redundancy (Arbel
´
aez
et al., 2011; Polewski et al., 2024). Furthermore,
Bayesian optimization (Frazier, 2018; Snoek et al.,
2012) have found widespread use in machine learning
for hyperparameter optimization. Building on those
two well-established techniques, the first stage of our
method utilizes the Bayesian optimization algorithm
to determine the optimal values for the parameters
used in identifying the optimal contour used in path
initialization. This stage gives the method the ability
to adapt to the complexity and variations of colors in
the input image. In the second stage, after path initial-
ization, we optimize the initialized path with a cus-
tom loss function composed of three loss functions:
the reconstruction loss to preserve information and
guarantee a correct direction of the path during the
optimization process, the Laplacian smoothness loss
based on (Sorkine et al., 2004) to ensure smoothness
of paths, and the overlap loss to prevent shapes from
overlapping with other correct shapes. With these two
stages, LIVBOC ensures not only achieving superior
reconstruction fidelity but also reducing the number
of required paths and computational time. Unlike pre-
vious methods, which often require a fixed number of
iterations for convergence, LIVBOC achieves high-
quality results with few iterations by integrating ro-
bust initialization with efficient optimization. Thus,
our method fits for applications demanding scalable
and high-fidelity vector graphics, such as web graph-
ics, logo design, and digital content creation.
In our experiments, we compare our LIVBOC
method with LIVE (Ma et al., 2022) and demonstrate
LIVBOC strength. Through a multi-faceted compari-
son, we highlight the efficiency of our method in gen-
erating high-quality vector image with optimal primi-
tives (optimal SVG file size).
Key contributions of our paper are as follows:
• We propose LIVBOC, a novel image vectoriza-
tion method that generates minimized SVG files
by using an adaptive number of points for differ-
ent paths.
• The resulting vector primitives are optimal or
near-optimal, allowing the user to better manip-
ulate the vectors.
• The proposed LIVBOC method is 3-5 times faster
than current state-of-the-art machine learning-
based vectorization.
2 RELATED WORK
2.1 Layered Image Vectorization via
Semantic Simplification (LIVSS)
The LIVSS method (Wang et al., 2024) is a novel ap-
proach to image vectorization that uses semantic in-
formation to guide the vectorization process. In this
method, Score Distillation Sampling (SDS) and se-
mantic segmentation are calculated to iteratively sim-
plify an input image while preserving essential fea-
tures. By doing that, LIVSS creates a series of sim-
plified layers, each of these layers contains varying
levels of detail. Then these layers are optimized to
create a high-quality vector representations with ad-
justable levels of fidelity.
The main strength of LIVSS is the ability to bal-
ance detail preservation and vectorization compact-
ness. Using semantic segmentation, the method pri-
oritizes meaningful regions first, with the possibility
of ignoring redundant details in the background. This
technique makes LIVSS effective for applications of
vectorizing images containing semantic meaningful
shapes.
Despite that, the performance of LIVSS is highly
dependent on the quality of its semantic segmentation.
Therefore, if segmentation fails to accurately delin-
eate complex objects or overlapping structures, the
resulting vectorized output may lose critical details
or introduce artifacts. Another limitation that might
appear in this method is that it might be unable to
segment unnecessary shapes, which can lead to low
quality images. It would be valuable to compare our
method with LIVSS, but at the time of writing this
paper, the LIVSS code is not publicly available.
2.2 SuperSVG
SuperSVG (Hu et al., 2024) represents a significant
advance in the field of image vectorization in 2024.
This method mainly utilize a superpixel-based frame-
work to decompose raster images into regions with
uniform colors and textures, thereby enabling effi-
cient vectorization. SuperSVG is the two-stage self-
training framework. In the first stage, the model re-
constructs the primary structure of the image using
superpixels, focusing on large, uniform areas in the
image. In the second stage, it improves the output by
VISAPP 2025 - 20th International Conference on Computer Vision Theory and Applications
832