7 FUTURE DIRECTIONS
The primary direction for future research is in val-
idating and refining the architecture with other use
cases. Since the original BIW tool was deployed in
a wide-ranging set of domains, from heath-care and
drug discovery to blogs and patent databases, we ide-
ally would like to be able to add one example per
domain which would helpfully validate our APIs and
services. Such exercises would also allow us to refac-
tor and better structure the different API layers. We
would expect more abstractions of common interfaces
and also further aggregation and specialization of the
domain-specific portions. In the end, we expect to
evolve our APIs into common and domain-specific
text mining ‘languages’.
Another direction of our research is in improving
our pricing models. Currently our metering infras-
tructure captures some basic usage data. This en-
abled us to create an initial pricing model to charge
customers of BISON on frequency of usage, i.e., fre-
quency of API calls. However, this pricing model is
limited, in particular, for most domain applications of
text analytics and text mining, the end-user actions are
very iterative and repetitive. This means that a model
that measures frequency of usage may not accurately
reflect the value the end-user got from the services.
Instead of simple frequency usage measurements we
need to have means for measuring usage patterns as
well as meter the results from the service calls.
In addition, coarser-grained pricing models, such
as a subscription pay-as-you go or a per-user-session
pay-as-you-use model may be more appropriate for
many of our use cases. To that end, we have made
the pricing model component flexible and pluggable.
This will allow us to experiment with various models
for different clients and domains.
Finally, another important direction is in the user
interfaces and user experience aspects of the entire
BISON stack. In particular we want to create an ini-
tial set of reusable UI views that we can use to as-
semble new solutions. Also, since in general, as men-
tioned before, the process of mining for insights is
very iterative and repetitive, it would be beneficial if
the end-users are able to collaborate and provide feed-
back to the tool; for example, enabling the ability to
tag particular documents in a result set or the abil-
ity to rate documents, entities, and other aspects of
a BISON application. By aggregating the feedback
we could improve the experience of users of common
data sources, e.g., enterprise users mining enterprise
data. The BISON engine could also use this aggre-
gated feedback to improve some of the clustering and
mining operations.
REFERENCES
Agrawal, R. (1999). Data Mining: Crossing the Chasm.
In Proceedings of 5th ACM SIGKDD International
Conference on Knowledge Discovery and Data Min-
ing (KDD-99), San Diego, CA.
Arsanjani, A. (2005). Service-Oriented Modeling and Ar-
chitecture. Technical report, IBM Global Services.
Bass, L., Clements, P., and Kazman, R. (1998). Software
Architecture in Practice. Addison-Wesley, Boston,
MA.
Beck, K. and Andres, C. (2005). eXtreme Programming
Explained: Embrace Change, 2nd Edition. Addison-
Wesley, Boston, MA.
Cheung, W., Zhang, X., Wong, H., Liu, J., Luo, Z., and
Tong, F. (2006). Service-Oriented Distributed Data
Mining. IEEE Internet Computing, 4(10):44–54.
Codd, E. F., Codd, S. B., and Salley, C. T. (1998). Providing
OLAP to User-Analyts: An IT Mandate. Technical
report, E.F. Codd Associates.
Cody, W., Kreulen, J. T., Krishna, V., and Spangler, W. S.
(2002). The Integration of Business Intelligence
and Knowledge Management. IBM Systems Journal,
4(41).
Curbera, F., Goland, Y., Klein, J., Leymann, F.,
Roller, D., Thatte, S., and Weerawarana, S.
(2002). Business Process Execution Lan-
guage for Web Services, Version 1.0. www-
128.ibm.com/developerworks/library/specification/ws-
bpel/.
Exaltec (2006). Exaltec’s b+ J2EE-SOA Application Gen-
erator. www.exaltec.com/appgenerator.html.
Fowler, M., Beck, K., Brant, J., Opdyke, W., and Roberts,
D. (1999). Refactoring: Improving the Design of Ex-
isting Code. Addison-Wesley, Boston, MA.
Gamma, E., Helm, R., Johnson, R., and Vlissides, J.
(1995). Design Patterns: Elements of Reusable
Object-Oriented Software. Addison-Wesley, Reading,
MA.
Guedes, D., Meira, W. J., and Ferreira, R. (2006).
Anteater: A Service-Oriented Architecture for High-
Performance Data Mining. IEEE Internet Computing,
4(10):36–43.
Hempel, J. and Lehman, P. (2005). The MySpace Genera-
tion. Technical report, Business Week Online.
Kumar, A., Kantardzik, M., and Madden, S. (2006). Dis-
tributed Data Mining: Frameworks and Implementa-
tions. IEEE Internet Computing, 4(10):15–18.
Sonic (2006). Sonic SOA Workbench.
www.sonicsoftware.com/products/sonic
workbench.
Spangler, W. S., Cody, W., Kreulen, J. T., and Krishna,
V. (2003). Generating and Browsing Multiple Tax-
onomies over a Document Collection. Journal of
Management and Information Systems, 4(19):191–
212.
ICEIS 2007 - International Conference on Enterprise Information Systems
588