What Can Go Wrong When Fine-tuning BERT?
2 min readFeb 27
--
There are a lot of explanations elsewhere, here I’d like to share some example questions in an interview setting.
When fine-tuning BERT (Bidirectional Encoder Representations from Transformers) for your use case, what can go wrong? Or what should you pay attention to?