chat gpt’s twitter bio

chat gpt’s twitter bio

GPT

GPT stands for Generative Pre-trained Transformer, a type of large language model developed by Open AI. It is designed to understand and generate human-like text based on the input it receives. GPT uses a machine learning technique called transformers, which allows it to process language in a way that captures the context and relationships between words over long text sequences.


TRANSFORMERS

In 2017, the Transformer architecture was introduced in a paper titled "Attention is All You Need.’

This was an revolutionary paper which went in dept into the problems with pre existing models.

Its key ideas and facts were-

"This inherently sequential nature precludes parallelization within training examples, which becomes critical at longer sequence lengths, as memory constraints limit batching across examples."

A tweet by Andrej Karpathy, top AI engineer at Tesla.

A tweet by Andrej Karpathy, top AI engineer at Tesla.

.

**