Components in RASA NLU

ByAngela Marpaung June 8, 2020December 31, 2023

In RASA, user messages is excecuted for every sequence of components. All components executed in RASA can be customized to meet any requirements in pipeline defined in config.yml file. We can even build our own (custom) component in RASA NLU.

Configurating the Right Components

Every components have different functions whether its for pre-processing text, intent classification, entity extraction, feature engineering, etc. A pipeline usually consist of three main function of components:

Tokenization
Tokenization is the process of splitting messages/ text into word or tokens. Some built-in components of tokenization in
Featurization
Featurization is the process of feature engineering in the user messages. In RASA, we can either used pre-trained word embeddings or supervised embeddings (e.g count vectors).
Entity Recognition/ Intent Classification/ Response Selectors
We can choose any components we want to perform any of these task. We can even combine all the components related to these tasks depending on what we want to build.

One of model that is often being mentioned in RASA NLU is spaCy. SpaCy is pre-trained word-embedding models in many different languages from GloVe word embedding or fastText word embedding.

We tried to customize pipeline with spaCy model (pre-trained embeddings) and try to compare the results with the supervised embeddings.

Comparison results of pretrained embeddings with spacy and supervised embeddings

For more explanation on how to configurate pipeline with existing components in RASA, you can check out the video below:

Update:

Here i tried to change the NLU configuration of the pipeline using other model, that is BERT.
Rasa provided the pipeline and configurations for BERT model using Hugging Face model in pipeline.

I’m using this Hugging Face configuration:

Hugging Face pipeline configurations

And the test result graphic is as shown below:

Angela Marpaung

A third-year student majoring in Computer Science. Currently studying at Telkom University, Indonesia.

I have interest in Analyzing Data, Machine Learning, and also Artificial Intelligence field.

Artificial Intelligence | Data Science | Natural Language Processing (NLP)

Maximizing KTP-OCR Performance using Regular Expression
ByFirhan Maulana Rusli June 27, 2020December 31, 2023

Regular expression (or RE) specifies a set of strings that matches it; the functions in this module let you check if a particular string matches a given regular expression (or if a given regular expression matches a particular string, which comes down to the same thing). we will maximize the performance of KTP-OCR using RE….

Read More Maximizing KTP-OCR Performance using Regular Expression
Artificial Intelligence | Computer Vision | Data Science

Getting Coordinate and Cropping an Image with OpenCV
ByFirhan Maulana Rusli June 6, 2020December 31, 2023

OpenCV is popular library for computer vision. OpenCV is an open source computer vision and machine learning software library. OpenCV was built to provide a common infrastructure for computer vision applications and to accelerate the use of machine perception in the commercial products. OpenCV uses machine learning to detect object/faces in picture. for detecting face,…

Read More Getting Coordinate and Cropping an Image with OpenCV
Artificial Intelligence | Chatbot | Natural Language Processing (NLP) | ngrok | RASA

Integrating RASA chatbot assistant to Facebook Messenger
ByAngela Marpaung June 15, 2020December 31, 2023

Here is tutorial on how we can integrate RASA chatbot assistant to Facebook Messenger. Here we’ll be using ngrok to expose a local server on the internet, so make sure you have installed ngrok before.If you haven’t installed ngrok, you can install it here. Make sure you have created an account in facebook for developers…

Read More Integrating RASA chatbot assistant to Facebook Messenger
Artificial Intelligence | Computer Vision | Data Science | Natural Language Processing (NLP)

Confidence in KTP-OCR using Pytesseract
ByFirhan Maulana Rusli July 18, 2020December 31, 2023

In previous blog, we already learn how to crop an image http://about.lovia.id/getting-cordinate-and-cropping-an-image-with-opencv/. Then we will learn how to got confidence using pytesseract, After much searching, there was some some ways to got confidence in my KTP-OCR. Pytesseract give us a lot of syntax that can we use, such as : #this line of code will…

Read More Confidence in KTP-OCR using Pytesseract
Artificial Intelligence | Chatbot | Natural Language Processing (NLP) | RASA

Build Contextual AI Assistant (Chatbot)
ByAngela Marpaung May 22, 2020December 31, 2023

We have discussed about chatbot and why should we build a chatbot on the previous post http://about.lovia.id/build-your-first-rasa-chatbot/. On the previous post, we build a simple chatbot that asks a user’s feeling and tries to cheer the user up when they’re sad.This time we are going to set our game up and build a contextual chatbot…

Read More Build Contextual AI Assistant (Chatbot)
Artificial Intelligence | Computer Vision

KTP-OCR in Python using Pytesseract
ByFirhan Maulana Rusli May 18, 2020December 31, 2023

KTP-OCR is an open source python package that attempts to create a production grade KTP extractor. The aim of the package is to extract as much information as possible yet retain the integrity of the information. For example, we will upload the photo first like this: And after u upload the photo the system will…

Read More KTP-OCR in Python using Pytesseract

Similar Posts