5 CONCLUSIONS
Summarizing all the results, we can conclude the
following:
In order to exclude errors in NLP outcomes
caused by CoreNLP parser, the source text
must avoid verb forms syntactically equals to
noun forms.
Results of manual processing can be more
complete, because of expert’s ability to add
implicit knowledge, but differ from the written
text due to expert’s ability to modify actions
and events ad hoc as well as to find synonyms
in the text.
Structural relations between extracted domain
objects differ in two approaches. In case of
NLP it depends on completeness of data in the
sentence. In case of manual processing it
depends on expert’s knowledge about the
domain, thus, on implicit knowledge.
Incomplete knowledge can be extracted,
because sentences lack information on
who/what is the subject when verbs are in the
active voice; however, if after processing this
knowledge is absent, then text can be
supplemented with necessary information.
Incomplete information may lead to
identification of more abstract or more specific
functional features for one and the same
functional characteristic.
A sentence can contain information on both
executor and recipient. The recipient may be
another actor. Now, this actor is specified as an
object of action A. However, there are cases
when it is worth to specify them separately.
Therefore, processing of textual descriptions even in
formal style have issues related to the technical side
(i.e., parsing models and outcome representation
formats) and to particularities of the natural language
(textual description may not have all needed
knowledge, structures of sentences differ, implicit
synonyms are used). The latter can be partially solved
either by using machine learning or manual pre-
processing of knowledge, e.g., specification of use
case scenarios or user stories and exhaustive software
requirements.
Future research expects refinement of the
proposed steps to decrease ambiguity in the
processing results: to find patterns of sentences that
minimize arbitrary interpretations of structural
relations between domain objects, to elaborate more
specific separation of domain objects involved in
actions, as well as identify cause-and-effect relations
between functional characteristics of the system.
REFERENCES
Amardeilh, F., Laublet, P. and Minel, J.-L., 2005.
Document annotation and ontology population from
linguistic extractions. In Proceedings of the 3rd
international conference on Knowledge capture - K-
CAP ’05. New York, New York, USA: ACM Press, pp.
161–168.
Angeli, G., Johnson Premkumar, M. and Manning, C.D.,
2015. Leveraging Linguistic Structure For Open
Domain Information Extraction. In Proceedings of the
Association of Computational Linguistics (ACL).
Beijing, China: Association for Computational
Linguistics, pp. 344–354.
Asnina, E. and Osis, J., 2010. Computation Independent
Models: Bridging Problem and Solution Domains. In
Proceedings of the 2nd International Workshop on
Model-Driven Architecture and Modeling Theory-
Driven Development. Lisbon: SciTePress - Science and
and Technology Publications, pp. 23–32.
Asnina, E. and Osis, J., 2011. Topological Functioning
Model as a CIM-Business Model. In Model-Driven
Domain Analysis and Software Development. Hershey,
PA: IGI Global, pp. 40–64.
Asnina, E. and Ovchinnikova, V., 2015. Specification of
decision-making and control flow branching in
Topological Functioning Models of systems. In ENASE
2015 - Proceedings of the 10th International
Conference on Evaluation of Novel Approaches to
Software Engineering.
Bhala, V., Vidya Sagar, R. and Abirami, S., 2014.
Conceptual modeling of natural language functional
requirements. The Journal of Systems and Software, 88,
pp.25–41.
Bies, A. et al., 1995. Bracketing Guidelines for Treebank II
Style, Available at: https://repository.upenn.edu/
cis_reports/index.4.html.
Donins, U., 2012. Semantics of Logical Relations in
Topological Functioning Model. In Proceedings of the
7th International Conference on Evaluation of Novel
Approaches to Software Engineering, Wrocław,
Poland, 29-30 June, 2012. SciTePress, pp. 217–223.
Elbendak, M., Vickers, P. and Rossiter, N., 2011. Parsed
use case descriptions as a basis for object-oriented class
model generation. Journal of Systems and Software,
84(7), pp.1209–1223.
Elstermann, M. and Heuser, T., 2016. Automatic Tool
Support Possibilities for the Text-Based S-BPM
Process Modelling Methodology. In Proceedings of the
8th International Conference on Subject-oriented
Business Process Management - S-BPM ’16. New
York, New York, USA: ACM Press, pp. 1–8.
Friedrich, F., Mendling, J. and Puhlmann, F., 2011. Process
Model Generation from Natural Language Text. In
Proceedings of the 23rd International Conference on
Extracting Core Elements of TFM Functional Characteristics from Stanford CoreNLP Application Outcomes
601