AI Training
With this feature, your chatbot can undergo training using your website texts or PDF documents. Once the training is successfully completed, the chatbot will be able to answer questions related to your content.
Initiate Chatbot Training
To initiate the chatbot training, input the URL of your content in:
Settings > NuluAI > Training Sources
You can enter URLs for websites, text or PDF files, or XML sitemaps. PDF and text files can also be uploaded from:
Settings > NuluAI > Training Sources > PDF and Text Files
Once the sources are set, click the Train your chatbot
button and await completion of the training process. You can add and manage personalized questions and answers from:
Settings > NuluAI > Questions and answers
and from the chatbot training window.
Information
- It is only possible to upload files in PDF and TXT formats.
- You can provide the website URL and all child URLs will be included and crawled, but with large websites, it is more efficient and less prone to errors and infinite link loops to utilize an XML sitemap instead of relying on the website URL. You can create it with a service like https://www.xml-sitemaps.com (opens in a new tab).
- If you want to train your chatbot using specific pages from your website instead of all of them, you can make use of an XML sitemap. Create one using a tool like https://www.xml-sitemaps.com (opens in a new tab), and then remove the pages you do not wish to include by editing the file in a text editor. To use the XML sitemap, you need to upload it either onto your server or an external online location. Afterward, add the URL of the sitemap in:
Settings > NuluAI > Training sources
- You can upload large files and your XML sitemap with a service like https://tmpfiles.org (opens in a new tab).
- If you are training NuluAI with a multi-language website, you can limit the chatbot to retrieve answers only from the pages in the user's language. To activate this feature, go to:
Settings > NuluAI > Multilingual Training Sources
For Nulu Business to comprehend the language of your web pages, the <html>
must contain the attribute lang
.
As soon as the training is completed, the uploaded files are removed. Every time you need to retrain the chatbot, you are required to upload all the necessary documents for its training once again.
To add new training sources, simply train the chatbot again. The previous training sources will not be lost, and only the new sources will be added.
Automatic Training Sources
- The articles are used as training sources automatically.
- The conversations are used as training sources automatically. The training is done via a cron job every 24 hours. Only user and agent messages are used, chatbot messages are ignored.
Important Notes
- There are character limits for training the chatbot. You can view the character limits [here](link to character limits).
- The embedding model is essential for training your chatbot and handling all user messages. We currently use the
text-embedding-3-small
model. It is necessary for these scenarios and cannot be disabled or changed. - The responses generated by NuluAI have the feature to include in the reply a link to the corresponding website page where the answer was sourced.
- Click
Delete training
to remove all previous training data for the chatbot. - The embeddings are stored as JSON files in the Nulu Business uploads folder and are secured using the password-by-filename approach.
This MDX format is designed for GitBook documentation, providing a clean and professional layout. Let me know if you need any further adjustments!