Problem
SemEval-2020 Task 12 was an AI competition held in 2020. The goal was to use AI to identify offensive language in social media.
The leaderboard of the competition is presented in the following table.
Approach
We used only the teacher-student paradigm to transfer knowledge from a large model (Teacher) to a small model (Student).
Result
iSemantics team surpassed 81 teams and scored 93.19% F1-score. We achieved this result using a model 3x smaller than the UHH-LT team, who ranked 1st.
A 3x smaller model means 3x faster decision time and 3x lower running cost when hosting on cloud services.
To learn more about automated data labeling techniques, read the full article here.