Blog

Computing Self-Attention with Relative Position Representations

In this article, we will introduce how to computing self-attention with relative position representations in deep learning.

An Introduction to Fine-tune Bert to Create Specific Vector in Few-Shot Learning

In this article, we will introduce how to generate specific vector by fine-tuning bert in few-shot learning.

Understand Parametrization Strategy in Deep Learning

Parametrization strategy can make the optimization more stable and improve the efficiency when file-tuning a model.

Concentrate Target and Text for Text Classification in Bert

We can input a text sequence to bert model for classification. However, if this text sequence has a target text or object, how to use bert?

Example to Convert a Scalar or Number to Vector in Deep Learning

Steps to Convert a Scalar or Number to Vector in Deep Learning

If you only have a scalar or number, how to convert it to be a vector for matrix operation? Paper Attention Is All You Need gives a method.

Does Bert or ELMo Representation Encode Syntax Information?

In this article, we can find representation generated from Bert or ELMo can encode syntax information.

An Introduction to Squash Function in Deep Learning

Squash function is a non-linear function that ensures the length of almost zero for short vectors and a length of slightly below 1 for long vectors.

the effect of topic attention in deep learning

An Introduction to Topic Attention in Deep Learning

Topic attention is proposed in paper Aspect Category Detection via Topic-Attention Network, it can incorporate topic information in self-attention mechanism.

Steps to Rebuild a Target-Oriented Dependency Tree From a Dependency Tree

Generally, a dependency tree is not target-oriented, we can build a dependency tree for a sentence by some python libraries.

An Introduction to Relational Graph Attention Network

Relational Graph Attention Network is the extension of the original GAT, it is proposed in paper: Graph Attention Network