What are Transformers (Machine Learning Model)?

What are Transformers (Machine Learning Model)?

Introduction to Transformers

In this section, the speaker introduces transformers and their capabilities.

What are Transformers?

  • Transformers are generative pre-trained transformer models that can produce text that looks like it was written by a human.
  • They can write poetry, craft emails, and even come up with their own jokes.

How do Transformers Work?

  • Transformers consist of two parts: an encoder and a decoder.
  • The encoder works on the input sequence while the decoder operates on the target output sequence.
  • The transformer takes a sequence of tokens (words in a sentence) and predicts the next word in the output sequence through iterating through encoder layers.
  • The attention mechanism provides context around items in the input sequence, which gives transformers an advantage over algorithms like RNNs that must run in sequence.

Applications of Transformers

  • Language translation is one example where transformers can be applied to convert one language into another.
  • Document summaries are another great example where you can feed in a whole article as input and generate an output summary of just a couple of sentences that summarize the main points.
  • Beyond just language, transformers have done things like learn to play chess and perform image processing that even rivals the capabilities of convolutional neural networks.
Video description

Learn more about Transformers → http://ibm.biz/ML-Transformers Learn more about AI → http://ibm.biz/more-about-ai Check out IBM Watson → http://ibm.biz/more-about-watson Transformers? In this case, we're talking about a machine learning model, and in this video Martin Keen explains what transformers are, what they're good for, and maybe ... what they're not so good at for. Download a free AI ebook → http://ibm.biz/ai-ebook-free Read about the Journey to AI → http://ibm.biz/ai-journey-blog Get started for free on IBM Cloud → http://ibm.biz/Bdf7QA Subscribe to see more videos like this in the future → http://ibm.biz/subscribe-now #AI #Software #ITModernization

What are Transformers (Machine Learning Model)? | YouTube Video Summary | Video Highlight