Simultaneous Estimation of Spectral Reﬂectance and Normal

from a Small Number of Images

Masahiro Kitahara

, Takahiro Okabe

, Christian Fuchs

and Hendrik P. A. Lensch

Department of Artiﬁcial Intelligence, Kyushu Institute of Technology, Iizuka, Japan

Computer Graphics Group, T¨ubingen University, T¨ubingen, Germany

Keywords:

Multispectral Imaging, Photometric Stereo, Spectral Reﬂectance, Surface Normal.

Abstract:

Spectral reﬂectance is inherent characteristics of an object surface and therefore useful not only for computer

vision tasks such as material classiﬁcation but also compute graphics applications such as relighting. In this

study, by integrating multispectral imaging and photometric stereo, we propose a method for simultaneously

estimating the spectral reﬂectance and normal per pixel from a small number of images taken under multispec-

tral and multidirectional light sources. In addition, taking attached shadows observed on curved surfaces into

consideration, we derive the minimum number of images required for the simultaneous estimation and propose

a method for selecting the optimal set of light sources. Through a number of experiments using real images,

we show that our proposed method can estimate spectral reﬂectances without the ambiguity of per-pixel scales

due to unknown normals, and that, when the optimal set of light sources is used, our method performs as well

as the straightforward method using a large number of images. Moreover, we demonstrated that estimating

both the spectral reﬂectances and normals is useful for relighting under novel illumination conditions.

1 INTRODUCTION

The appearance of an object depends not only on the

object itself but also on the light source illuminat-

ing the object and on the camera capturing its im-

age. Therefore, the same object appears differently

under different light sources and with different cam-

eras. This appearance variation often causes the per-

formance degradation of various computer vision al-

gorithms.

The fraction of incident light power at each wave-

length that is reﬂected on an object surface is called

spectral reﬂectance. Since the spectral reﬂectance is

inherent characteristics of an object surface and inde-

pendent of light sources and cameras, it is useful for

computer vision tasks such as material classiﬁcation

and scene segmentation as well as computer graph-

ics applications such as relighting. In particular, the

use of spectral reﬂectance can prevent the occurrence

of so-called metamerism, i.e. a coincidental match of

apparent RGB colors of object surfaces with different

spectral reﬂectances.

When the spectral distribution of the incident light

to an object surface is known in advance, the spec-

tral reﬂectance is computed by division, i.e. the ratio

between the reﬂected light observed on the surface to

the incident light at each wavelength. We can measure

the spectral distribution of the reﬂected light by using

point sensors such as spectrometers (Wellman, 1981)

and area sensors such as multispectral cameras (Yam-

aguchi et al., 2006) and hyperspectral cameras (Gat,

2000; Schechner and Nayar, 2002). Instead of using

those special sensors, RGB cameras can be used to-

gether with multispectral light sources (Park et al.,

2007; Han et al., 2013).

Unfortunately, however, the reﬂected light de-

pends not only on the spectral reﬂectance but also

on the normal of an object surface, and the above

techniques cannot estimate surface normals because

they assume that the direction or location of the light

source is ﬁxed. Therefore, the estimated spectral re-

ﬂectance has the ambiguity of a per-pixel unknown

scale, i.e. the inner product between the light source

direction and the surface normal at each pixel. In

other words, we cannot tell whether the reﬂectance is

large (small) or the normal faces in a similar (dissimi-

lar) direction to the light source. Such a per-pixel un-

known scale could degrade the performance of mate-

rial classiﬁcation and scene segmentation, and more-

over relighting under novel light source directions is

303

Kitahara M., Okabe T., Fuchs C. and P. A. Lensch H..

Simultaneous Estimation of Spectral Reﬂectance and Normal from a Small Number of Images.

DOI: 10.5220/0005302503030313

In Proceedings of the 10th International Conference on Computer Vision Theory and Applications (VISAPP-2015), pages 303-313

ISBN: 978-989-758-089-5

 2015 SCITEPRESS (Science and Technology Publications, Lda.)

400 500 600 700 800 900 1000

Figure 1: The images of a plaster relief (left) captured by

using the multispectral light stage (right). The pixel values

are scaled for display purpose.

impossible without using normals.

In this paper, we address the estimation of the

spectral reﬂectance and normal of an object surface

by integrating multispectral imaging and photometric

stereo. Speciﬁcally, we assume the Lambertian model

and the low-dimensional linear model of spectral re-

ﬂectance (Parkkinen et al., 1989), and estimate both

the coefﬁcients of the spectral reﬂectance and normal

per pixel from the images taken under multispectral

and multidirectional light sources as shown in Fig-

ure 1. It is obvious that the straightforward method,

i.e. photometric stereo (Woodham, 1980) followed by

multispectral imaging (Park et al., 2007; Han et al.,

2013) can estimate both the normals and spectral re-

ﬂectances of matte surfaces from a large number of

images. However, there is a room for signiﬁcantly re-

ducing the number of images.

Accordingly, we propose a method for simulta-

neously estimating spectral reﬂectances and normals

from a small number of images taken under multi-

spectral and multidirectional light sources on the ba-

sis of the alternating least square (ALS) algorithm. In

addition, taking attached shadows observed on curved

surfaces under varying light source directions into

consideration, we derive the minimum number of im-

ages required for estimating the spectral reﬂectance

and normal per pixel and propose a method for select-

ing the optimal set of light sources in terms of noise

from given light sources.

The main contribution of this study is three-

fold; (i) the simultaneous estimation of spectral re-

ﬂectances and normals from a small number of im-

ages, (ii) the derivation of the minimum number of

images required for the simultaneous estimation, and

(iii) the light source optimization for robust estima-

tion from a small number of images. Through a

number of experiments using real images, we con-

ﬁrmed that, even from a small number of images,

the proposed method can accurately estimate spec-

tral reﬂectances without the ambiguity of per-pixel

unknown scales, and demonstrated that the estimated

spectral reﬂectances and normals enable relighting

under novel light source spectral distributions as well

as under novel light source directions.

2 REFLECTION MODEL

Assuming the Lambertian model, the pixel value i at a

surface point illuminated by a directional light source

is described as

i =

l(λ)ρ(λ)c(λ)dλ s

⊤

n, (1)

where λ is the wavelength of incident and reﬂected

light, and l(λ), ρ(λ), and c(λ) are the spectral distri-

bution of the light source, the spectral reﬂectance at

the point, and the spectral sensitivity of a camera re-

spectively. The direction of the light source and the

normal at the point are denoted by s

s and n

n. Our pro-

posed method assumes that the directions and spec-

tral distributions of the light sources and the spec-

tral sensitivities of the camera are known, i.e. they

are calibrated in advance, and estimates the spec-

tral reﬂectance and normal from the pixel values ob-

served under multispectral and multidirectional light

sources.

The spectral reﬂectance is a continuous function

with respect to wavelength, and describes how the

reﬂectance changes depending on the wavelength of

incident and reﬂected light. Since the number of

unknowns is large, e.g. about 80 unknowns when

estimating spectral reﬂectances in the visible range

every 5 nm, the estimation of spectral reﬂectances

is prone to an ill-posed and/or ill-conditioned prob-

lem. Accordingly, our proposed method stably esti-

mates spectral reﬂectances by constraining the space

of spectral reﬂectances on the basis of their statisti-

cal characteristics. Speciﬁcally, our method makes

use of the low-dimensional model of spectral re-

ﬂectance (Parkkinen et al., 1989). They apply PCA

to the dataset of spectral reﬂectances, and show that

any spectral reﬂectance is approximately represented

by a linear combination of basis functions as

ρ(λ) =

∑

k=1

(λ), (2)

where K, α

, and b

(λ) are the number of basis func-

tions, the coefﬁcients of the linear combination, and

the basis functions respectively. In this study, we use

the same basis functions and set K = 8 according to

Parkkinen et al.

VISAPP2015-InternationalConferenceonComputerVisionTheoryandApplications

304

Substituting eq.(2) into eq.(1), we obtain

i =

∑

k=1

l(λ)b

(λ)c(λ)dλ s

⊤

n. (3)

Therefore, the estimation of the spectral reﬂectance

and normal at a surface point results in estimating the

normal n

n and the coefﬁcients of the linear combina-

tion α

(k = 1,2,...,8). The number of unknowns at

each pixel is 10 in total, i.e. 2 for the normal and 8 for

the spectral reﬂectance. Note that a normal is a 3D

vector with unit length.

3 STRAIGHTFORWARD

METHOD

In this section, we explain the straightforward

method, i.e. photometric stereo for estimating nor-

mals followed by multispectral imaging for estimat-

ing spectral reﬂectances.

3.1 Estimating Surface Normal

We assume that images under multispectral and

multidirectional light sources are captured by us-

ing a multispectral light stage similar to the existing

ones (Ajdin et al., 2012; Gu and Liu, 2012). Specif-

ically, the light stage has P clusters of light sources

at different directions, and each cluster has Q light

sources with different spectral distributions. We de-

note the pixel value observed under the p-th light

source direction (p = 1,2,...,P) and the q-th light

source spectral distribution (q = 1,2,...,Q) and by the

r-th channel of an RGB camera (r = 1,2,3) by

pqr

∑

k=1

(λ)b

(λ)c

(λ)dλ s

⊤

n. (4)

Taking summation with respect to the light source

spectral distribution q and the camera channel r, we

obtain the gray scale

′

∑

q=1

∑

r=1

pqr

∑

q=1

∑

r=1

∑

k=1

(λ)b

(λ)c

(λ)dλ s

⊤

= ρ

′

⊤

n, (5)

where ρ

′

is an unknown scalar independent of the in-

dex of the light source direction p.

By using the P gray images, we rewrite eq.(5) in a

matrix form as







′



















′

, (6)

′

= S(ρ

′

n). (7)

This means that we can estimate normals in a similar

manner to the classic photometric stereo (Woodham,

1980). In general, if the number of light sources is

larger than three, we can estimate the normal up to an

unknown scale by using the pseudo inverse matrix of

S as

′

n =



⊤



−1

⊤

′

= S

′

. (8)

Since a normal has unit length, the normal is given

n = ρ

′

n/|ρ

′

n|. Note that we remove the p-th light

source direction from the equations above if a surface

point is shadowed under that light source direction.

In our experiments, we detect shadows by using a

threshold on pixel values.

3.2 Estimating Spectral Reﬂectance

Once the normal is estimated, by using P × Q color

images, a set of liner equations with respect to the

coefﬁcients of the spectral reﬂectance α

is derived

from eq.(4) as







pqr













·· ·

dλs

⊤

n ···



















(9)

In theory, we can estimate the coefﬁcients of the spec-

tral reﬂectance in a similar manner to the above by

using the pseudo inverse matrix.

Unfortunately, however, it is reported that such

a naive estimation tends to be unstable, when the

number of light source spectral distributions is small

and/or the spectral distributions are not optimally cho-

sen (Park et al., 2007; Han et al., 2013). There-

fore, similar to those existing techniques, we incor-

porate the smoothness and non-negativity constraints

into the estimation;

{

,...,

} = arg min

{α

,...,α

}

(

∑

p=1

∑

q=1

∑

r=1

pqr

−

∑

k=1

(λ)b

(λ)c

(λ)dλs

⊤

∑

k=1

(λ)

dλ







subject to

∑

k=1

(λ) ≥ 0, (10)

SimultaneousEstimationofSpectralReflectanceandNormalfromaSmallNumberofImages

305

where w is an empirical parameter that balances the

likelihood term and the smoothness term that tries

to minimize the second order derivatives. We set w

in eq.(10) and eq.(11) to 300 throughout our exper-

iments. We used the MATLAB implementation of

the active-set algorithm for solving the above linear

least-square problem with linear constraints. Once the

coefﬁcients of the linear combination are estimated,

we can obtain the spectral reﬂectance by substituting

them into eq.(2).

4 PROPOSED METHOD

4.1 Overview

The straightforward method described in Section 3

uses photometric stereo and multispectral imaging

separately. Therefore, it requires the images taken un-

der multidirectional light sources with the same spec-

tral distribution

for estimating normals and the im-

ages taken under multispectral light sources for es-

timating spectral reﬂectances. Speciﬁcally, ignoring

attached shadows observed on curved surfaces, the

straightforward method requires 3 images taken under

the light sources at different non-coplanar directions

but with the same spectral distribution, and 3 color

images taken under the light sources with different

spectral distributions

since eq.(3) has 8 unknowns

with respect to the spectral reﬂectance and each im-

age yields 3 constraints (8 < 3× 3).

On the other hand, ignoring attached shadows, we

must be able to estimate the spectral reﬂectance and

normal at a surface point from at least 4 color images

in theory, since the number of unknowns is 10 in to-

tal as described after eq.(3) and each image yields 3

constraints (10 < 4 × 3). This motivates us to pro-

pose a method for simultaneously estimating spectral

reﬂectances and normals from a small number of im-

ages by integrating multispectral imaging and photo-

metric stereo.

In the rest of this section, we formulate the simul-

taneous estimation of the spectral reﬂectance and nor-

mal per pixel from a small number of images. Then,

taking attached shadows observed on curved surfaces

into consideration, we derive the minimum number

of images required for the simultaneous estimation.

Finally, we propose a method for selecting the opti-

mal light sources from those of the multispectral light

The gray scale images deﬁned by eq.(5) are used in

subsection 3.2.

Those spectral distributions should be chosen carefully

so that eq.(9) or eq.(10) can be solved.

stage on the basis of a variant of the noise propagation

analysis (Drbohlav and Chantler, 2005).

4.2 Simultaneous Estimation

We propose a method for simultaneously estimating

spectral reﬂectances and normals from a small num-

ber of images. By integrating spectral imaging and

photometric stereo, our proposed method is formu-

lated as

{

α} = arg min

n,α

α}

(

∑

(p,q)∈I

∑

r=1

pqr

−

∑

k=1

(λ)b

(λ)c

(λ)dλs

⊤

∑

k=1

(λ)

dλ







subject to

∑

k=1

(λ) ≥ 0, (11)

where α

α = (α

,α

,·· · ,α

)

⊤

is the coefﬁcient

vector of the spectral reﬂectance and I speciﬁes the

set of images from which the spectral reﬂectance and

normal are estimated.

We can see that the cost function in eq.(11) has a

bilinear form with respect to two variables; it is lin-

ear with respect to the normal n

n when the coefﬁcient

vector α

α is ﬁxed, and vice versa. Accordingly,we use

the ALS algorithm, which sets an initial value for one

variable and then iteratively updates one of the two

variables while the other is ﬁxed in turn, for optimiz-

ing eq.(11).

More speciﬁcally, the normal n

n is updated in a

similar manner to eq.(8) when the coefﬁcient vector

α is ﬁxed, and the coefﬁcient vector α

α is updated in a

similar manner to eq.(10) when the normal n

n is ﬁxed.

In our experiments, we tested two initializations. One

is n

n = (0,0,1)

⊤

, i.e. the normal faces toward a cam-

era. Another is α

α = (1,0,0, ··· , 0)

⊤

, i.e. the spectral

reﬂectance is the same as the ﬁrst principal compo-

nent. We experimentally conﬁrmed that both of the

initializations achieve similar performance, and that

the optimization converges within a few iterations.

It takes about 140 msec to estimate the spectral re-

ﬂectance and normal at each pixel by using MATLAB

on a PC with Core i7.

4.3 Number of Required Images

The point on an object surface is in attached shadow

under a light source, when the angle between the light

VISAPP2015-InternationalConferenceonComputerVisionTheoryandApplications

306

source direction s

s and the normal n

n at the point is

larger than π/2, i.e. s

⊤

n < 0. In general, attached

shadows are inevitably observed on curved surfaces

such as a sphere under varying light source direc-

tions. Since the pixel intensity in attached shadow is

zero, i.e. the left hand side of eq.(3) is 0, we cannot

obtain any constraint about the spectral reﬂectance

and normal from the shadowed pixel except that the

surface normal faces in the opposite direction to the

light source (s

⊤

n < 0)

. Therefore, in order to esti-

mate the spectral reﬂectance and normal per pixel on a

curved surface, we need to take attached shadows into

consideration, and use a sufﬁcient number of images

taken under different light sources so that each point

on the surface is illuminated by the required number

of light sources.

In this study, we derive the number of required im-

ages under the following two assumptions. First, we

assume that the shape of an object of interest is arbi-

trary but convex; denoting the viewing direction by v

we assume arbitrary normals n

n such that v

⊤

n > 0 but

do not take cast shadows

into consideration. Note

that the number of required images could be arbitrar-

ily large, if we assume arbitrary complex shapes such

as a tree with a large number of branches and leaves.

Second, in the numerical analysis below, we assume

that a point on an object surface is illuminated by a

light source, if the inner product between the light

source direction and the normal is larger than a small

threshold ε;

⊤

n > ε. (12)

This is because we detect shadows by using a thresh-

old on pixel values and dark pixels are more likely to

be affected by noise.

Thus, in order to estimate the spectral reﬂectance

and normal at every point on an arbitrary convex sur-

face, the set of color images taken under multispec-

tral and multidirectional light sources has to satisfy

the following conditions.

(A) Each point is illuminated in at least 4 images be-

cause eq.(11) has 10 unknowns in total and each

image yields 3 constraints (10 < 4 × 3).

(B) Each point is illuminated in at least 3 images

taken under different light source spectral dis-

tributions for updating α

α in the ALS algorithm

(8 < 3 × 3).

It is reported that normals can be recovered from at-

tached shadows by using a large number of images taken

under varying light source directions (Okabe et al., 2009).

The cast shadows are observed on concave surfaces,

when s

⊤

n > 0 but the light source is occluded by the other

surface or the other part of the same surface.

taken under different light source directions for

updating n

n in the ALS algorithm.

Based on the assumption about illuminated sur-

face points by using a threshold in eq.(12), it is triv-

ial that at least 3 images taken under different light

source directions are required for illuminating every

point on an arbitrary convex surface at least once (see

Appendix A). Therefore, for satisfying the conditions

(B), 3 images (a triplet) taken under different light

source directions are required for each spectral dis-

tribution, i.e. 9 images (3 triplets) are required in to-

tal. In our experiments, we capture each image by si-

multaneously turning on two light sources at the same

direction but with different spectral distributions so

that the combination of the two spectral distributions

has overlap with the spectral sensitivities of the RGB

channels of a camera.

By using the above 9 images, every point on

an arbitrary convex surface is illuminated by 3 light

sources with different spectral distributions at least

once. Therefore, the condition (C) is satisﬁed, when

the light source directions for the triplets are different

from each other, i.e. when a set of 9 images (a nonu-

plet) is taken under different light source directions.

Moreover, we can numerically show that some of the

nonuplets satisfying the conditions (B) and (C) also

satisfy the condition (A) (see Appendix B). Hence,

we can estimate the spectral reﬂectance and normal at

every point on an arbitrary convex surface from 9 im-

ages. In our experiments, we conﬁrmed that our light

stage has a number of nonuplet candidates which sat-

isfy the conditions (A), (B), and (C).

4.4 Optimizing Light Sources

In the previous subsection, we show that a set of 9 im-

ages (a nonuplet) is required for estimating the spec-

tral reﬂectance and normal at every point on an ar-

bitrary convex surface, and that our light stage has a

number of nonuplet candidates. Since the accuracy of

the estimated spectral reﬂectances and normals could

depend on the nonuplet used for the estimation, we

propose a method for selecting the optimal nonuplet,

in other words, selecting the optimal light sources un-

der which the optimal nonuplet is taken. In particu-

lar, we focus on the optimization of light source di-

rections, since our light sources have only 6 different

spectral distributions in visible range and we have al-

ready used all of them.

The optimization of light source directions is

discussed in the context of the classic photometric

stereo (Drbohlav and Chantler, 2005). They study

how the zero-mean Gaussian noises in pixel intensi-

SimultaneousEstimationofSpectralReflectanceandNormalfromaSmallNumberofImages

307

ties propagate to the normals estimated by using the

pseudo inverse matrix, and show that the noises are

ampliﬁed by



⊤



−1

(13)

through the propagation, where σ is the standard de-

viation of the Gaussian noises and S is a light source

matrix deﬁned in eq.(6) and eq.(7). By minimizing

eq.(13), they ﬁnd the optimal light source conﬁgura-

tions in terms of noise, e.g. 3 orthogonal directions

when the number of light sources is 3. Unfortunately,

however, they ignore attached shadows which are in-

evitably observed on curved surfaces under varying

light source directions. Actually, on curved surfaces,

surface points with different normals could be illumi-

nated by different sets of light sources.

Accordingly,we extend Drbohlav and Chantler by

taking attached shadows into account, and select the

optimal set of light sources from its candidates L in

the minimax manner. Speciﬁcally, we evaluate the

maximum error of the estimated surface normal for

each candidate S = {s

,·· · ,s

} with respect to

arbitrary normals n

n such that v

⊤

n > 0, and then select

the candidate that minimizes the maximum error from

the candidates L;

S = argmin

S∈L

max

⊤

n)S(n

−1

. (14)

Here, S(n

n) consists of light sources included in a can-

didate and illuminating a surface point with normal

⊤

n) = (··· ,s

,·· · ), (15)

where {s

∈ S|s

⊤

n > ε}.

5 EXPERIMENTS

5.1 Multispectral Light Stage

We implemented a multispectral light stage similar

to the existing ones (Ajdin et al., 2012; Gu and Liu,

2012), and captured images under multispectral and

multidirectional light sources on the basis of multi-

plexed sensing (Schechner et al., 2003), which is a

well-known technique for increasing signal-to-noise

ratio without increasing measurement time. In this

study, we used 120 images in total, i.e. the number of

light source directions is 20 (P = 20) and the number

of light source spectral distributions is 6 (Q = 6)

We removed a few images from the 120 images and

used the remaining images for the following analysis and

estimation, because the corresponding light sources did not

work well when those images were captured.

(a)

0.5

400 500 600 700

0.5

400 500 600 700

0.5

400 500 600 700

0.5

400 500 600 700

(b)

(i)

(ii)

(iii)

(iv)

(i) (ii)

(iii) (iv)

Figure 2: The estimated spectral reﬂectances (b) at four

points on the plaster relief (a). The red-lines stand for the

straightforward method using 120 images and the green-

dotted lines stand for our proposed method using 9 images.

(c), (d), and (e) show the normals estimated by using the

straightforward method and our method and their difference

respectively. Those results demonstrates that our method

performs as well as the straightforward method.

Figure 1 shows the 120 images of a plaster re-

lief. Here, all the LEDs are calibrated so that they

have the same intensity. The ﬁrst two columns show

the images under purple light, followed by those un-

der blue, green, yellow-green, orange, and red lights.

We can see that the color observed on the surface

changes according to the light source spectral dis-

tribution. Moreover, when we focus on the 20 im-

ages under the same light source spectral distribution,

we can see that the intensity observed on the sur-

face changes according to the light source direction.

Our proposed method estimates both the spectral re-

ﬂectances and normals from the color and shading ob-

served under multispectral and multidirectional light

sources.

In the rest of this section, we ﬁrst conﬁrmed that

our proposed method using a small number of images

performs as well as the straightforward method using

a large number of images. Second, we conﬁrmed that

the use of the optimal set of light sources is effective

for robust estimation. Finally, we evaluated the accu-

racy of our method by comparing the estimated spec-

tral reﬂectances and normals with their ground truth

values.

5.2 Number of Images

Figure 2 (b) shows the estimated spectral reﬂectances

at four points on the plaster relief (a). We can see

that the spectral reﬂectances estimated from 9 images

by using our proposed method (green-dotted lines)

are consistent with those estimated from 120 images

VISAPP2015-InternationalConferenceonComputerVisionTheoryandApplications

308

by using the straightforward method (red-solid lines).

In addition, this result qualitatively demonstrates that

our method can estimate spectral reﬂectances accu-

rately because the plaster relief is made of a uniform

material, and the estimated spectral reﬂectances look

almost the same.

Figure 2 (c), (d), and (e) show the normals esti-

mated from 120 images by using the straightforward

method, those estimated from 9 images by using our

proposed method, and their difference in which the

angle from 0 to π/2 is linearly mapped to 8 bit gray

scale. Here, normals are represented by using a color

map. Speciﬁcally, the x, y, and z elements of a normal

is linearly mapped to R, G, and B channels (see a ref-

erence sphere attached to the normal map). This result

qualitatively demonstrates that our method can esti-

mate normals accurately; e.g. a surface point toward

a camera is bluish and a surface point toward right

is greenish, and that (d) is consistent with (c) except

for concave areas. Note that both the straightforward

method and our method do not necessarily work well

in those areas because they do not take interreﬂection

into consideration.

Table 1 shows the average difference between

the normals estimated by using the straightforward

method and those estimated by using our proposed

method for four different objects; relief, bread,

checker, and ball (see “best” row). This result quan-

titatively demonstrates that our method using a small

number of images performs as well as the straightfor-

ward method using a large number of images because

the differences are small enough.

5.3 Light Source Optimization

As described in subsection 4.4, in general, there are a

number of sets of images (and corresponding sets of

light sources) from which we can estimate spectral re-

ﬂectances and normals of an arbitrary convex surface,

but the accuracy of the estimated spectral reﬂectances

and normals could depend on the set of light sources

used for the estimation. In Figure 3, (b) and (d) show

the images of a wooden bread taken under the opti-

mal, i.e. the best (a) and the worst (c) sets of light

sources derived from eq.(14). Here, we show the light

source directions in the 3D (xyz) space by project-

ing them on the 2D (xy) plane. The inner and outer

circles correspond to the zenith angle θ = π/4 and

θ = π/2 respectively. The selected light sources are

represented by symbols in cyan, and the light sources

represented by the same symbol have the same spec-

tral distribution. We can see that the best set of light

sources distributes at wider angles than the worst one.

Figure 4 (b) shows the estimated spectral re-

(c)

(d)

(a)

(b)

Figure 3: (b) and (d) show the images of the wooden bread

taken under the best (a) and the worst (c) sets of light

sources represented in cyan respectively. The best set of

light sources distributes at wider angles than the worst one.

ﬂectances at four points on the wooden bread (a). (c),

(d), and (f) show the normals estimated from 120 im-

ages, those estimated from the 9 images taken un-

der the best and the worst sets of light sources. (e)

and (g) show the difference between (c) and (d) and

the difference between (c) and (f) respectively. Note

that some artifacts due to specular reﬂection com-

ponents are visible since we assume the Lambertian

model. We can see that spectral reﬂectances and nor-

mals can be estimated from both of the best and the

worst sets of light sources, but that the estimated spec-

tral reﬂectances and normals depend on the set of light

sources used for the estimation. In particular, (e) and

(g) clearly demonstrate that our proposed method per-

forms as well as the straightforward method when the

best set of light sources is used, but does not perform

well when the worst set of light sources is used. This

means that the optimization of light sources is criti-

cally important for robust estimation when the num-

ber of images is small.

Table 1 also shows the difference between the nor-

mals estimated by using the straightforward method

and those estimated by using our proposed method

with the best or the worst set of light sources for the

four objects. This result quantitatively demonstrates

SimultaneousEstimationofSpectralReflectanceandNormalfromaSmallNumberofImages

309

(a)

0.5

400 500 600 700

0.5

400 500 600 700

0.5

400 500 600 700

0.5

400 500 600 700

(b)

(i)

(ii)

(iii)

(iv)

(i) (ii)

(iii) (iv)

(c)

(d)

(f)

(e)

(g)

Figure 4: The estimated spectral reﬂectances (b) at four

points on the wooden bread (a). The red-solid lines, green-

dotted lines, and blue-dotted lines stand for the straight-

forward method and our proposed method with the best

and the worst set of light sources. (c), (d), and (f) show

the normals estimated by using the straightforward method

and our method with the best and the worst sets of light

sources. (e) and (g) show the difference between (c) and (d)

and the difference between (c) and (f), and demonstrate that

our method performs as well as the straightforward method

when the best set of light sources is used.

Table 1: The difference between the normals estimated by

using the straightforward method and our proposed method

with the best or the worst set of light sources.

object

relief bread checker ball

best 1.68

◦

3.17

◦

1.05

◦

1.98

◦

worst 3.07

◦

5.42

◦

2.19

◦

4.03

◦

that the optimization of light sources works well be-

cause the difference of the best case is smaller than

that of the worst case.

5.4 Comparison with Ground Truth

First, we compared the spectral reﬂectances of a

color checker estimated by using the straightforward

method and our proposed method with those mea-

sured by using a spectrometer. Figure 5 shows the

image (a) and the estimated spectral reﬂectances (b)

and normals (c)(d)(e) of the color checker. In (b), the

red-solid lines, green-dotted lines, blue-dotted lines,

and magenta-dotted lines stand for the measurement,

the straightforward method, and our proposed method

with the best and the worst sets of light sources. Here,

we could not estimate the spectral reﬂectances and

0.5

400 500 600 700

0.5

400 500 600 700

0.5

400 500 600 700

0.5

400 500 600 700

0.5

400 500 600 700

0.5

400 500 600 700

0.5

400 500 600 700

0.5

400 500 600 700

0.5

400 500 600 700

0.5

400 500 600 700

0.5

400 500 600 700

0.5

400 500 600 700

0.5

400 500 600 700

0.5

400 500 600 700

0.5

400 500 600 700

0.5

400 500 600 700

0.5

400 500 600 700

0.5

400 500 600 700

0.5

400 500 600 700

0.5

400 500 600 700

0.5

400 500 600 700

0.5

400 500 600 700

0.5

400 500 600 700

0.5

400 500 600 700

(b)

3 42

(a) (c) (d) (e)

Figure 5: The measured/estimated spectral reﬂectances (b)

at each patch of the color checker (a). The red-solid

lines, green-dotted lines, blue-dotted lines, and magenta-

dotted lines stand for the measurement, the straightforward

method, and our proposed method with the best and the

worst sets of light sources. The estimated spectral re-

ﬂectances are consistent with the measured ones. (c), (d),

and (e) are the normals estimated by using the straightfor-

ward method, and our proposed method with the best and

the worst sets of light sources.

normals at black areas including the top-right color

patch because they were too dark and treated as shad-

ows. Table 2 shows the RMS errors of the spec-

tral reﬂectances from 400 nm to 700 nm

estimated

It is known that the basis functions of spectral reﬂectances

are not necessarily accurate at short-wavelength range (Parkki-

nen et al., 1989). In addition, the measured spectral reﬂectances

are also not accurate in that range because the halogen bulb

used for our experiment is not bright enough.

VISAPP2015-InternationalConferenceonComputerVisionTheoryandApplications

310

Table 2: The RMS errors of the estimated spectral re-

ﬂectances of the color checker; the straightforward method,

our proposed method with the best set of light source, and

that with the worst set of light sources from top to bottom

in each ﬁeld.

row\col.

1 2 3 4

0.054 0.080 0.044 N/A

1 0.059 0.078 0.043 N/A

0.060 0.078 0.044 N/A

0.051 0.075 0.060 0.015

2 0.072 0.070 0.107 0.016

0.070 0.070 0.101 0.015

0.040 0.049 0.123 0.025

0.039 0.057 0.109 0.025

0.039 0.055 0.110 0.024

0.044 0.060 0.088 0.059

4 0.041 0.065 0.080 0.057

0.039 0.063 0.081 0.059

0.068 0.061 0.048 0.063

5 0.081 0.058 0.042 0.070

0.081 0.058 0.041 0.069

0.025 0.068 0.037 0.038

0.030 0.067 0.027 0.052

0.030 0.067 0.027 0.050

by using the straightforward method and by using

our method with the best and the worst sets of light

sources from top to bottom in each ﬁeld. The aver-

ages of the RMS errors are 0.056, 0.058, and 0.058

respectively. This result quantitatively demonstrates

that our method can accurately estimate spectral re-

ﬂectances even from a small number of images.

Second, we evaluated the estimated surface nor-

mals. Figure 6 shows the image (a) and the es-

timated spectral reﬂectances (b) and normals of a

wooden ball. (d), (f), and (h) are the normals esti-

mated by using the straightforward method and our

proposed method with the best and the worst sets of

light sources. (e), (g), and (i) show the differences

between the ground truth (c) and the estimated ones

(d)(f)(h). We assume that the shape of the ball is a

perfect sphere although it looks a little distorted both

locally and globally to some extent. Therefore, a part

of the errors common to the estimated surfacenormals

by using the straightforward method and our method

would be due to those distortions. In addition, we

can observe white spots caused by specular reﬂection

components. The average errors of normals estimated

by using the straightforward method and our method

with the best and the worst sets of light sources are

5.11

◦

, 5.52

◦

, and 7.43

◦

respectively including the de-

viation of the ball from a perfect sphere and the errors

due to specular reﬂection components. Thus, we can

see quantitatively that our method can accurately es-

timate normals even from a small number of images.

(d) (e)

(f) (g)(c)

(h) (i)

(a) (b)

0.5

400 500 600 700

0.5

400 500 600 700

0.5

400 500 600 700

0.5

400 500 600 700

(i)

(ii)

(iii)

(iv)

(i) (ii)

(iii) (iv)

Figure 6: The estimated spectral reﬂectances (b) at four

points on the wooden ball (a). The red-solid lines, green-

dotted lines, and blue-dotted lines stand for the straightfor-

ward method and our proposed method with the best and

the worst set of light sources. (d), (f), and (h) are the nor-

mals estimated by using the straightforward method and our

method with the best and the worst sets of light sources. (e),

(g), and (i) show the differences between the ground truth

our method performs as well as the straightforward method

when the best set of light sources is used.

6 APPLICATION

To demonstrate the effectiveness of estimating both

the spectral reﬂectances and normals by using our

proposed method, we synthesized images under novel

illumination conditions. Figure 7 shows the synthe-

sized images of the plaster relief and wooden ball un-

der 9 different illumination conditions; three spectral

distributions times three light source directions. The

spectral reﬂectances and normals estimated by using

the straightforward method (top) and our proposed

method (bottom) are used. We can see that the syn-

thesized images look quite natural, and that the bot-

tom images are consistent with the top images. This

result demonstrates that our proposed method extends

the capability of spectral relighting (Park et al., 2007;

SimultaneousEstimationofSpectralReflectanceandNormalfromaSmallNumberofImages

311

0.5

400 500 600 700

0.5

400 500 600 700

0.5

400 500 600 700

0.5

400 500 600 700

0.5

400 500 600 700

0.5

400 500 600 700

Figure 7: The synthesized images under novel illumination

conditions. The spectral reﬂectances and normals estimated

by using the straightforward method (top) and our proposed

method (bottom) are used.

Han et al., 2013) so that one can deal with novel

light source directions. Note that some artifacts due

to specular reﬂection components are observed on

the ball because our method assumes the Lambertian

model.

7 CONCLUSION AND FUTURE

WORK

In this study, by integrating multispectral imaging

and photometric stereo, we proposed a method for si-

multaneously estimating the spectral reﬂectance and

normal per pixel from a small number of images

taken under multispectral and multidirectional light

sources. In addition, taking attached shadows ob-

served on curved surfaces into consideration, we de-

rived the minimum number of images required for the

simultaneous estimation and proposed a method for

selecting the optimal light sources in terms of noise.

Through a number of experiments using real im-

ages, we showed that our proposed method can es-

timate spectral reﬂectances without the ambiguity of

per-pixel unknown scales, and that, when the opti-

mal set of light sources is used, our method using

only 9 images performs as well as the straightforward

method using a large number of images. In addition,

we demonstrated that estimating both the spectral re-

ﬂectances and normals is useful for relighting under

novel light source spectral distributions as well as un-

der novel light source directions.

One direction of future study is the extension to

non-Lambertian surfaces. As mentioned in Section 5

and Section 6, the estimated spectral reﬂectances and

normals are sometimes contaminated by specular re-

ﬂection components since we assume the Lambertian

model. We will use the robust estimation for remov-

ing those components as outliers or model them by

using parametric or non-parametric representation in

the future.

ACKNOWLEDGEMENTS

A part of this work was supported by JSPS KAK-

ENHI Grant No. 25280057.

REFERENCES

Ajdin, B., Finckh, M., Fuchs, C., Hanika, J., and

Lensch, H. (2012). Compressive higher-order sparse

and low-rank acquisition with a hyperspectral light

stage. Technical report, Eberhard Karls Universit¨at

T¨ubingen. WSI-2012-01.

Drbohlav, O. and Chantler, M. (2005). On optimal light

conﬁgurations in photometric stereo. In Proc. IEEE

ICCV2005, volume 2, pages 1707–1712.

Gat, N. (2000). Imaging spectroscopy using tunable ﬁlters:

a review. In Proc. SPIE Wavelet Applications VII, vol-

ume 4056, pages 50–64.

Gu, J. and Liu, C. (2012). Discriminative illumination:

per-pixel classiﬁcation of raw materials based on op-

timal projections of spectral BRDF. In Proc. IEEE

CVPR2012, pages 797–804.

Han, S., Sato, I., Okabe, T., and Sato, Y. (2013). Fast spec-

tral reﬂectance recovery using DLP projector. IJCV.

DOI 10.1007/s11263-013-0687-z.

Okabe, T., Sato, I., and Sato, Y. (2009). Attached shadow

coding: estimating surface normals from shadows un-

der unknown reﬂectance and lighting conditions. In

Proc. IEEE ICCV2009, pages 1693–1700.

Park, J.-I., Lee, M.-H., Grossberg, M., and Nayar, S. (2007).

Multispectral imaging using multiplexed illumination.

In Proc. IEEE ICCV2007, pages 1–8.

Parkkinen, J., Hallikainen, J., and Jaaskelainen, T. (1989).

Characteristic spectra of munsell colors. JOSA A,

6(2):318–322.

Schechner, Y. and Nayar, S. (2002). Generalized mosaicing:

wide ﬁeld of view multispectral imaging. IEEE Trans.

PAMI, 24(10):1334–1348.

Schechner, Y., Nayar, S., and Belhumeur, P. (2003). A

theory of multiplexed illumination. In Proc. IEEE

ICCV2003, pages 808–815.

Wellman, J. (1981). Multispectral mapper: imaging spec-

troscopy as applied to the mapping of earth resources.

In Proc. SPIE Imaging Spectroscopy, volume 268,

pages 64–73.

VISAPP2015-InternationalConferenceonComputerVisionTheoryandApplications

312

Woodham, R. (1980). Photometric method for determin-

ing surface orientation from multiple images. Optical

Engineering, 19(1):139–144.

Yamaguchi, M., Haneishi, H., Fukuda, H., Kishimoto, J.,

Kanazawa, H., Tsuchida, M., Iwama, R., and N, O.

(2006). High-ﬁdelity video and still-image commu-

nication based on spectral information: natural vision

system and its applications. In Proc. SPIE-IS&T Elec-

tronic Imaging, volume 6062.

APPENDIX A

We give a counterexample and prove that 2 images (or

2 light sources) are insufﬁcient for illuminating every

point on an arbitrary convex surface at least once. Let

us consider a unit sphere illuminated by a single direc-

tional light source. The single light source illuminates

at most almost the half of the occluding boundary of

the sphere according to the assumption of illuminated

pixels in eq.(12). Then, the length of illuminated oc-

cluding boundary is (π − δ), where δ is a small num-

ber. Since the length of the entire occluding bound-

ary is 2π and (π − δ) × 2 < 2π, we cannot illuminate

the entire occluding boundary at least once by using

2 light sources.

APPENDIX B

In a similar manner to Appendix A, we give a

counterexample and prove that 8 images (or 8 light

sources) are insufﬁcient for illuminating every point

on an arbitrary convex surface at least 4 times, i.e. for

satisfying the condition (A) in subsection 4.3. Con-

sidering a unit sphere illuminated by a single direc-

tional light source, the length of illuminated occlud-

ing boundary is (π− δ). Since (π− δ)× 8 < 4×2π, it

is clear that we cannot illuminate the entire occluding

boundary at least 4 times by using 8 light sources.

SimultaneousEstimationofSpectralReflectanceandNormalfromaSmallNumberofImages

313