Discovery of Logical Structure in a MultisensorEnvironment Based on Sparse Events
AI*IA 2003 Workshop on Ambient Intelligence,8th National Congress of Italian Association for Artiﬁcial Intelligence
J. Trindler
1
,
3
, R. Zwiker
1
,
3
, U. Rutishauser
2
,
3
, J. Joller
1
, R. Douglas
3
{
trindlerrzwikerurutjojorjd
}
@ini.phys.ethz.ch
1
University of Applied Sciences Rapperswil, Switzerland
2
California Institute of Technology, Pasadena, USA
3
Institute of Neuroinformatics, ETH and University of Zurich, Switzerland
Abstract.
Several projects [BDY99,CC00,Bro97] attempt to make abuilding intelligent by controlling eﬀectors and handle sensors with functional learning. Most of them do not consider the structure as dynamic.We demonstrate an approach to discover online the functional structureof a multisensor environment. It is applied to a real commercial oﬃcebuilding where people do normal work. We determine relationships between sensors with a hebbian learning approach and prove that it ispossible to discover the physical structure almost completely.
1 Introduction
Modern commercial oﬃce buildings are typically equipped with a large numberof sensors (e.g. presence, temperature, illumination, humidity) and actors (e.g.lights, window blinds, wallswitches) which are connected to a common bus.The conﬁguration of the sensors and eﬀectors is usually explicitly speciﬁed by ahuman building manager (e.g. which sensor aﬀects which eﬀector).Modern approaches to the architecture of living and working environmentsemphasize the simple reconﬁguration of space to meet the needs, comfort andpreferences of its inhabitants and to minimize the consumption of resources suchas power. The ﬁnal goal of such an approach is to make a building intelligent.An intelligent building constantly adapts itself by learning from its users andtakes actions to control the eﬀectors of the building. By doing so it graduallylearns what a users behavior is and adapts to it.There are two fundamentally diﬀerent forms of learning involved in such asystem. It must i) learn about the behavior of its users (functional learning)and it must ii) learn how the devices it gets data from and controls are relatedrelative to each other (structural learning). We demonstrate and analyze functional learning in intelligent buildings in [RSDJ04] whereas in this paper weconcentrate on the aspect of structural learning.Adaptive control and functional learning works on the basis of functionalrelated sensors and eﬀectors. It is therefore essential to cluster the sensors and
2
eﬀectors of such a building into logical groups to control it in a ﬂexible andmanageable way. A well deﬁned structure is a fundamental basis for furtherfunctional learning.In this paper we describe and demonstrate a learning algorithm that dynamically discovers and hierarchically clusters functional related sensors and eﬀectors.We regard an intelligent building as a distributed multisensor environment. Thephysical constraints of such a building allow it to discover its functional structureonline by analyzing the data acquired during normal operation of the building.We make no domain speciﬁc assumptions apart from the fact that we assumethat if a state of a local area changes (e.g. when a person enters a room) thereare several sensors/eﬀectors that react to this event. We prove that this is areasonable and suﬃcient assumption. This algorithm can thus be applied to anarbitrary multisensor environment of similar properties.On one hand the discovered structure should be selforganizing and dynamicto be able to handle new or mobile devices. But at the same time it should alsobe more or less stable once it is learned as the functional structure of a building usually doesn’t change very often. This poses a typical stability/plasticitydilemma [Gro99]. An other aspect we analyze is whether the discovered structurematches a structure with properties of physical building constraints.
748575PresenceLightBlindsOutside SensorsLonWorksGatewayIPMASKey78 80 84 86 8879 872628
Fig.1.
Architecture of the test environment from which data were acquired.
1.1 Data Acquisition
To realistically demonstrate the performance of our approach we use data acquired from a real building. The data were acquired from a commercial oﬃcebuilding that is equipped with sensors and eﬀectors as shown in Fig. 1. Data wasacquired from 12 rooms located on the same ﬂoor with a total number of 290
3
sensors/eﬀectors, out of which we use 5 rooms and 26 devices for the analysisin this paper. All rooms are used daily. Data was recorded during a period of 4months. All data recording is event based. Every recorded event thus representsa change of state.One aspect that makes learning from such data diﬃcult is that they are verysparse. Even by recording for several months one only gets very few events fromserveral devices. This presents an additional challenge for using such data as abasis for learning (see last column of Table 1).
2 Approach to Discover Functional Structure
In order to determine the relationships between devices of a commercial buildingwe develop a model which represents correlations between device combinations.We regard the multisensor environment as a network of neurons. Neurons areconnected to each other by synapses. We represent this model with a graphconsisting of
n
nodes and
n
·
(
n
−
1) directed edges. Each edge represents asynapse and therefore with its weight the relationship between two nodes. Inour model a node represents a physical sensor or eﬀector.
2.1 Relationship Detection
To be able to ﬁnd local groups we have to modify the weight of each edge. Ouranalysis [TZDJ03] has shown that most information of the relationships betweennodes can be extracted from the relative timing of pre and postsynaptic actionsof a node. Fig. 2 shows an example of relative times inside and between rooms.
151010000.020.040.060.080.10.120.140.160.18inside room 86
f r e q u e n c y
relative time [s]151010000.020.040.060.080.10.120.140.160.18between room 86 and 75
f r e q u e n c y
relative time [s]
relative timing analysis A) B)
Fig.2.
A) Relative timing distribution of all event correlations inside room 86 and B)between room 86 and 75. The relative time axes are log scales. Both ﬁgures containsequences where nodes are active within 200s after another.
4
We prove our assumption that the relative timing between two updates fromdevices inside the same rooms is smaller and more evident than between devicesfrom diﬀerent rooms. With these results and inspired by the hebbian learningprinciple [Heb49] we potentiate only small relative timing sequences and notlong term sequences. We are strengthening correlations between devices basedon an exponential modiﬁcation function (1) to calculate the new weight
∆r
.
R
(
∆t
) =
me
∆t
−
tgaps
(1)where
∆t
is the relative time
∆t
=
t
i
+1
−
t
i
,
m
the modiﬁcation constant and
s
a scaling constant. Because of transmission delays of the ﬁeldbus we deﬁne asmall gap
t
gap
during which all sequences get modiﬁed with the maximum value(no exponential decay).Fig. 3 shows this modiﬁcation function which is applied within an asymmetriclearning window. Experiments by Song et al. [SMA00] demonstrated the feasibility of an exponential synaptic reward function by using spike timingdependentplasticity. The shape of the modiﬁcation function is a good approximation of theanalyzed data.
00.511.522.53x 10
4
00.10.20.30.40.50.60.70.80.91negative exponential modification function maxmodification=1.0 gap=1s s=5s
m o d i f i c a t i o n
relative time [ms]
Fig.3.
This modiﬁcation function is applied to modify correlations between devicesbased on their temporal activity.
By using an asymmetric learning window we can also obtain the causualrelationship between the devices. To determine the dynamic behavior we assumeeach node has to compete for its relationship to other nodes. A decay function,which decreases relations from a node everytime it is ﬁring, bounds the range of weight and makes it competitive.
5
2.2 Cluster Partitioning of Relationship Model
The aim of our approach is to have clusters constituted of functional related devices. Our algorithm determines the relationship between devices and representsthese as a weighted directed graph from which the clusters need to be extracted.To get several clusters, possibly even hierarchically dependent, the graphhas to be partitioned. Shi and Malik [SM00] prose a normalized cut algorithmwhich is mainly used in computer vision to segment images by trying to extractthe global impression. The clustering of our relationship model can be seen asa graph partitioning problem where local subgraphs have to be found. Normalminimized cut algorithms would start the partitioning with the most inactivedevices independently. In our case we want them also clustered with others. Thenormalized cut algorithm has an other criteria to determine the minimum cutof a graph
G
= (
V,E
) which partitions nodes
V
into two subsets
A
and
B
:
Ncut
(
A,B
) =
cut
(
A,B
)
assoc
(
A,V
) +
cut
(
A,B
)
assoc
(
B,V
) (2)where
assoc
(
A,V
) =
u
∈
A,v
∈
V
w
(
u,v
) is the total connection weight fromnodes in A to all nodes
V
in the graph and the same for
assoc
(
B,V
) with nodesin B. The computation of an optimal Ncut itself is NPcomplete, but Shi andMalik proved that it can be approximated by solving a generalized eigenvaluesystem:(
D
−
W
)
y
=
λDy
(3)where
d
(
i
) =
j
w
(
i,j
) and
D
is a
n
x
n
matrix with
d
on its diagonal.
W
is asymmetric
n
x
n
matrix with each individual edge weight. Shi and Malik provedthat the eigenvector
y
of the second smallest eigenvalue
λ
is a good approximation criteria to partition nodes into two subsets. Each element of vector
y
belongs to subset
A
if
y
i
>
0 and otherwise to subset
B
.The two subgraphs can again be recursivly partitioned by solving the eigenvalue system for each subgraph itself and this until Ncut reaches a threshold. Theresult is a hierarchical binary tree which has more or less independent clusterson its leaves. These clusters represents the functional structure.
3 Results and Discussion
After applying our algorithm to data from a real working environment and afterclustering our results into several groups we analyze the obtained results. Wealso compare our results with the physical structure.To determine the quality of our functional structure we have to ﬁnd a way toquantify the similarity of a cluster with the physical model. We therefore allocateeach cluster to the physical room, based on the number of devices. This can beachieved by calculating the maximum clarity
c
for all rooms. The clarity
c
of acluster
P
(where

P

is the power of set
P
) compared with a speciﬁc room
R
isdeﬁned as