5 CONCLUSIONS AND FUTURE
WORK
In this work we proposed a weak-supervised approach
to the ASC task. We developed a method that exploits
the data programming paradigm in order to address
the problem. We tested the method on two disjoint
domains, laptops and restaurants. We tuned a Snorkel
generative model for each domain and used them to
label the data in a probabilistic manner. Resulting
probabilistic training sets were used to fine-tune dis-
criminative models proposed in (Xu et al., 2019) that
were previously post-trained with unlabeled domain
data.
The experiments we carried out offer many hints
for future work. In particular, obtained results sug-
gest that the approach can be used cross-domain. We
plan to perform more experiments on larger datasets
in order to confirm initial intuitions. Our future work
will be focused on real world datasets to perform ex-
tensive experiments on scalability and performances
of the method we have proposed in this paper. More-
over, we will improve the method by defining further
LFs templates while maintaining its simplicity.
REFERENCES
Bach, S. H., Rodriguez, D., Liu, Y., Luo, C., Shao, H.,
Xia, C., Sen, S., Ratner, A., Hancock, B., Alborzi, H.,
Kuchhal, R., R
´
e, C., and Malkin, R. (2019). Snorkel
drybell: A case study in deploying weak supervision
at industrial scale. In Boncz, P. A., Manegold, S.,
Ailamaki, A., Deshpande, A., and Kraska, T., edi-
tors, Proceedings of the 2019 International Confer-
ence on Management of Data, SIGMOD Conference
2019, Amsterdam, The Netherlands, June 30 - July 5,
2019., pages 362–375. ACM.
Bird, S. (2006). NLTK: the natural language toolkit. In
Calzolari, N., Cardie, C., and Isabelle, P., editors,
ACL 2006, 21st International Conference on Compu-
tational Linguistics and 44th Annual Meeting of the
Association for Computational Linguistics, Proceed-
ings of the Conference, Sydney, Australia, 17-21 July
2006. The Association for Computer Linguistics.
Devlin, J., Chang, M., Lee, K., and Toutanova, K. (2019).
BERT: pre-training of deep bidirectional transformers
for language understanding. In Burstein, J., Doran,
C., and Solorio, T., editors, Proceedings of the 2019
Conference of the North American Chapter of the As-
sociation for Computational Linguistics: Human Lan-
guage Technologies, NAACL-HLT 2019, Minneapolis,
MN, USA, June 2-7, 2019, Volume 1 (Long and Short
Papers), pages 4171–4186. Association for Computa-
tional Linguistics.
Ding, Y., Yu, J., and Jiang, J. (2017). Recurrent neural net-
works with auxiliary labels for cross-domain opinion
target extraction. In Proceedings of the Thirty-First
AAAI Conference on Artificial Intelligence, February
4-9, 2017, San Francisco, California, USA., pages
3436–3442.
Hu, M. and Liu, B. (2004). Mining and summarizing cus-
tomer reviews. In Kim, W., Kohavi, R., Gehrke, J.,
and DuMouchel, W., editors, Proceedings of the Tenth
ACM SIGKDD International Conference on Knowl-
edge Discovery and Data Mining, Seattle, Washing-
ton, USA, August 22-25, 2004, pages 168–177. ACM.
Krizhevsky, A., Sutskever, I., and Hinton, G. E. (2012). Im-
agenet classification with deep convolutional neural
networks. In Bartlett, P. L., Pereira, F. C. N., Burges,
C. J. C., Bottou, L., and Weinberger, K. Q., edi-
tors, Advances in Neural Information Processing Sys-
tems 25: 26th Annual Conference on Neural Informa-
tion Processing Systems 2012. Proceedings of a meet-
ing held December 3-6, 2012, Lake Tahoe, Nevada,
United States., pages 1106–1114.
Manning, C. D., Surdeanu, M., Bauer, J., Finkel, J. R.,
Bethard, S., and McClosky, D. (2014). The stanford
corenlp natural language processing toolkit. In Pro-
ceedings of the 52nd Annual Meeting of the Associa-
tion for Computational Linguistics, ACL 2014, June
22-27, 2014, Baltimore, MD, USA, System Demon-
strations, pages 55–60. The Association for Computer
Linguistics.
Mikolov, T., Sutskever, I., Chen, K., Corrado, G. S., and
Dean, J. (2013). Distributed representations of words
and phrases and their compositionality. In Burges, C.
J. C., Bottou, L., Ghahramani, Z., and Weinberger,
K. Q., editors, Advances in Neural Information Pro-
cessing Systems 26: 27th Annual Conference on Neu-
ral Information Processing Systems 2013. Proceed-
ings of a meeting held December 5-8, 2013, Lake
Tahoe, Nevada, United States., pages 3111–3119.
Pablos, A. G., Cuadros, M., and Rigau, G. (2014). V3: un-
supervised generation of domain aspect terms for as-
pect based sentiment analysis. In Nakov, P. and Zesch,
T., editors, Proceedings of the 8th International Work-
shop on Semantic Evaluation, SemEval@COLING
2014, Dublin, Ireland, August 23-24, 2014., pages
833–837. The Association for Computer Linguistics.
Pablos, A. G., Cuadros, M., and Rigau, G. (2015). V3:
unsupervised aspect based sentiment analysis for se-
meval2015 task 12. In Cer, D. M., Jurgens, D.,
Nakov, P., and Zesch, T., editors, Proceedings of
the 9th International Workshop on Semantic Eval-
uation, SemEval@NAACL-HLT 2015, Denver, Col-
orado, USA, June 4-5, 2015, pages 714–718. The As-
sociation for Computer Linguistics.
Pablos, A. G., Cuadros, M., and Rigau, G. (2018).
W2VLDA: almost unsupervised system for aspect
based sentiment analysis. Expert Syst. Appl., 91:127–
137.
Pereg, O., Korat, D., Wasserblat, M., Mamou, J., and Da-
gan, I. (2019). Absapp: A portable weakly-supervised
aspect-based sentiment extraction system. CoRR,
abs/1909.05608.
Pontiki, M., Galanis, D., Pavlopoulos, J., Papageorgiou,
H., Androutsopoulos, I., and Manandhar, S. (2014).
A Weak-supervision Method for Automating Training Set Creation in Multi-domain Aspect Sentiment Classification
255