Since its conception 5 years ago, transformers have increasingly found applications outside of NLP - computer vision, reinforcement learning, and GANs. Here are a couple of resources that could get you started understanding the technology enabling GPT-3, AlphaFold2, etc.
➡️ https://web.stanford.edu/class/cs25/ - An exciting inventory of lecture and discussion videos from @StanfordEng that talks about the state-of-the-art transformer architectures and applications outside NLP
➡️ https://web.stanford.edu/class/cs224n/ - This course by @stanfordnlp has a lecture that introduces the attention mechanism and transformers in the context of NLP
➡️ https://huggingface.co/docs/transformers/index - A repository of transformer-based models that you can use online or download for further exploration provided by @huggingface
Progress in ML is accelerating more than ever - let me know of other resources that provide alternative perspectives on transformer architectures!