surrounding context into the answer generation
process.
• Language Barrier: Even though LLMs are fine-
tuned on multiple languages, they give their best
performance on English language. We Solved this
issue by using Google Translate to translate from
Indic languages into English and then generating
the answer from LLM.
• Hallucination: LLM’s native behavior is to
synthesis the answer if the answer is not known to
it, which produces wrong information. We are
therefore using Retrieval Augmented Generation
(RAG) technique, to search answers first in the
VectorStore of ancient text and we generate that
context and then we give Query and Context to
generate answers.
• Proof of Correctness: To ensure the correctness of
the answers, we have labelled the paragraphs of
ancient texts. This labelling facilitates the effective
retrieval of answers and allows us to display the
source of the information.
While the Vivechan system has shown significant
capabilities responding to queries related to ancient
Hindu scriptures, there are some limitations of the
system as per our observations.
• Language Barrier: The translation of Hindu
scriptures from Indic languages to English creates
limitation. Many of these texts carry nuanced
meanings that are deeply embedded in the cultural
contexts of their original languages. During
translation, some of these nuances may not be fully
captured leading to inaccurate answers.
• Spelling errors within the queries can significantly
impact its performance.
• The effectiveness of Vivechan is highly dependent
on several hyperparameters which include the
choice of tokenizer, the specific LLM used for
generating answers, the library utilized for
searching context, and the size of the context
window. For example, consider third example in
Table 2. Due to incorrect context retrieval, the
system gives incorrect answer. It identifies
Ashwatthama as son of Bhishma Pitamah which is
incorrect. As per Mahabharat, Ashwatthama was
son of Drona.
6
Online link: https://vivechan.streamlit.app/
8 CONCLUSIONS
The Vivechan project represents a significant
advancement in the domain of AI-driven question-
answering systems in the exploration of ancient
Indian texts and spiritual wisdom. Through the
integration of technologies such as LLMs, FAISS and
multilingual support, Vivechan offers users a
powerful tool for exploring the depths of spiritual
knowledge. Vivechan shows AI's ability to bridge the
knowledge gap between traditional wisdom and
modern technology, allowing a more appreciation of
spiritual teachings in the digital era. The system is
deployed and available for the public use
6
.
Disclosure of Interests. The authors declare that there is no
conflict of interests.
ACKNOWLEDGEMENTS
We acknowledge Google Colab for providing the
necessary GPU support for our model training and
experimentation processes. Special thanks to various
web resources and digital libraries that have made
Indian texts such as Shiv Puran, Ramayan, and
Bhagavad Gita accessible online. These resources
have been very useful for the preparation and
enrichment of our dataset.
REFERENCES
1. “Ramayangpt.”
https://github.com/abhiamishra/RamayanaGPT.
2. H. Thapliyal, “Unveiling the past: Ai-powered
historical book question answering,” Global journal of
Business and Integral Security, 2016.
3. A. Karekar, S. Limaye, A. Nara, and S. Panchal,
“Bhagavad geeta based chatbot,” in 2023 3rd
International Conference on Intelligent Technologies
(CONIT), pp. 1– 6, 2023.
4. P. P. Ray, “Chatgpt: A comprehensive review on
background, applications, key challenges, bias, ethics,
limitations and future scope,” Internet of Things and
CyberPhysical Systems, vol. 3, pp. 121–154, 2023.
5. A. Chowdhery, S. Narang, J. Devlin, M. Bosma, G.
Mishra, A. Roberts, P. Barham, H. W. Chung, C.
Sutton, S. Gehrmann, et al., “Palm: Scaling language
modeling with pathways,” Journal of Machine
Learning Research, vol. 24, no. 240, pp. 1–113, 2023.