Skip to content

Developing, Implementing, and Testing Reproducible AI Chatbots in Research and Educational Applications

Poster:

Project Poster Poster

Research Goal:

Create an custom ChatGPT 4o chatbot trained on any website. Implement my chatbot on the CyVerse Learning Materials websites

Introduction

CyVerse is a powerful computational infrastructure for researchers & educators

AI models have grown in popularity, efficiency, and capability with the rise of GPTs and LLMs

Researchers want access to AI chatbots trained on their datasets. These will yield faster, more accurate data-driven insights.

Following Open Science best-practices requires transparency and reproducibility through open source software, versus commercial software, like what OpenAI sells.

Creating detailed documentation helps enhance understanding, collaboration, and reproducibility of scientific research.

Discussion

Accuracy is verified by citations and hyperlinks of all sources

Chatbots enable users to quickly receive accurate responses

Each Vector Store (knowledge base) can be automatically updated with current information

Assistant also has access to the entirety of the internet to gather information

Implementation Cost: hosting a Virtual Machine

Running Cost: proportional to amount and size of messages

Message (token) limits are necessary to prevent overspending

ChatGPT streamlines the development process – it helps write code and resolve errors.

Chatbot
💬