The GPT models from OpenAI and Google’s BERT make use of the transformer architecture, likewise. These models also use a system named “Consideration,” by which the model can discover which inputs are entitled to much more consideration than others in specific conditions.To be sure a good comparison and isolate the influence on the finetuning