Parametrization strategy can make the optimization more stable and improve the efficiency when file-tuning a model.
Bert is widely used in text classification, However, Bert only can extract text feature from a text sequence. If you have other features, how to combine them with Bert output to implement classification.