Description

Our goal was to create a Proof of Concept (PoC) solution for matching messages from Telegram marketplaces.

There are two models that we developed:

Architecture and Pretraining

RoSBERTa-hermes-ru

RoSBERTa is based on ai-forever/ru-en-RoSBERTa with multiple heads for downstream tasks:

rubert-tiny-separater

Rubert is based on sergeyzh/rubert-tiny-turbo with a linear layer on top. The whole model was trained for classifying message types from Telegram marketplaces.

Labels:

Supported Languages

Russian, with English included.