Bert UI Tutorials and Examples

Multilingual vs. Monolingual BERT

Devlin et al. (2019) produced 2 BERT models, for English and Chinese. To support other languages, they trained a multilingual BERT (mBERT)

BERT Benchmarks in Large Pre-trained Language Models

An integral part involved in developing various PLMs is providing NLU multitask benchmarks used to demonstrate the linguistic abilities of new models and approaches

An Introduction to Mask Bert Inputs for Text Classification

In NLP, we do not mask any input embeddings for Bert in text classifcation task. However, in paper: Spelling Error Correction with Soft-Masked BERT proposed a masked method.

How to incorporate chinese phonic embedding and shape embedding into bert inputs?

Incorporating Extra Features for Bert Inputs in NLP

In this article, we will introduce a method to incorporate extra featrues into bert inputs to improve the performance of NLP task.

Evaluating Sentence Similarity Using BertScore

In this article, we will introduce a new metric for sentence similarity – BertScore, which has better performance than cosine similarity.

An Introduction to Fine-tune Bert to Create Specific Vector in Few-Shot Learning

In this article, we will introduce how to generate specific vector by fine-tuning bert in few-shot learning.

Concentrate Target and Text for Text Classification in Bert

We can input a text sequence to bert model for classification. However, if this text sequence has a target text or object, how to use bert?

Does Bert or ELMo Representation Encode Syntax Information?

In this article, we can find representation generated from Bert or ELMo can encode syntax information.

Beginner Guide to Use Bert for Multi-task Learning

This tutorial will discuss how to use bert model for multi-task learning. You can build your custom model from this post.

How to combine multiple features with Bert for document classification?

An Introduction to Bert Combine Multiple Features to Classification

Bert is widely used in text classification, However, Bert only can extract text feature from a text sequence. If you have other features, how to combine them with Bert output to implement classification.