Chomp Sci: The Computer Science Chatbot

date:	May 2023
team:	5
tech:	Figma, React, JSX, HTML, CSS, Docker, RASA
link:	Chomp Sci GitHub Repository

Introduction

Chomp Sci is a retrieval based chat bot with a web-based user interface that was developed as part of my senior project for my BS in Computer Science from the University of Florida. The chatbot uses RASA to infer the intent of user queries and respond with an appropriate pre-programmed response.

A test image — Chomp Sci use case illustration.

The figure below details the system architecture used by Chomp Sci. I built a React based chat widget which is embedded on a web page. The user interacts with the chat widget, and messages are exchanged between the front-end and back-end via Socket.IO. RASA Core and its Natural Language Understanding unit infers the intent of user input and either responds with a matching response or passes the query via HTTP to the RASA Action Server. The RASA Action Server allows programmatic actions to be taken in response to user input such as running a Python function to query a database.

My Contributions

Frontend Design

I worked on the frontend design of the Chomp Sci project page and chat widget in Figma. I created the Chomp Sci hero concept and high fidelity prototypes and assets for the chat widget.

React Chat Widget Implementation

I implemented the React based chat widget for Chomp Sci. This consisted of writing code to:

integrate the Socket.IO client with the chat widget
send messages via the Socket.IO client to the RASA Core server
listen for responses from the RASA Core server via side effects
manage the state of the chat widget:
- chat window open/ closed status
- chat window message log
process and present chat window message log entries:
- support for different message data formats sent by RASA, including simple strings, images, and quick link buttons
- support for markdown in responses via Remark
automate the testing of the chat widget's sub-components using React Testing Library

Chatbot Content and Training Data

I contributed training input for the RASA Natural Language Understanding Unit and pre-programmed responses for the following topics:

the campus vs. online experience at UF
computer science overview and history
miscellaneous chit-chat content
support for course name and prefix synonyms

Project Context and Relevance

The initial proposal for Chomp Sci was submitted in the Fall of 2022, before OpenAI had granted public access to ChatGPT. Large Language Models went viral during development of Chomp Sci and it was interesting to see how chat bot technology changed while working on the project.

While RASA's Natural Language Understanding unit utilizes AI to determine the intent of user input, that intent is matched to pre-programmed responses. This is in contrast to generative AI which not only processes user input but also generates a predictive response. The benefit of generative AI is that chatbots for the first time are able to convincingly simulate conversation. The drawback of this approach is that while LLMs generally produce feasible responses, they are not always appropriate ones. Microsoft Tay is an infamous example of how generative AI can go wrong.

RASA provides a framework for a hybrid approach to chatbot development, where recent advances in AI are leveraged to make the processing of input easier while leaving the authoring of responses to humans. Such a system is not good at simulating open-ended conversation, however it is a good tool for transactional conversations and avoiding inappropriate or inaccurate responses. Approaches to leveraging generative AI are currently changing at a rapid pace. Better curation of training data to improve alignment or multi-agent systems to incorporate AI powered checks and balances into products are likely to make Large Language Models suitable for use in a wider variety of contexts.

Back to portfolio.