adporn.net Understanding the model architecture - Applied AI: Getting Started with Hugging Face Transformers Video Tutorial | LinkedIn Learning, formerly Lynda.com

LinkedIn and 3rd parties use essential and non-essential cookies to provide, secure, analyze and improve our Services, and to show you relevant ads (including professional and job ads) on and off LinkedIn. Learn more in our Cookie Policy.

Select Accept to consent or Reject to decline non-essential cookies for this use. You can update your choices at any time in your settings.

Start free trial Sign in

From the course: Applied AI: Getting Started with Hugging Face Transformers

Unlock the full course today

Join today to access over 24,400 courses taught by industry experts.

Understanding the model architecture

Understanding the model architecture

From the course: Applied AI: Getting Started with Hugging Face Transformers

Start my 1-month free trial Buy for my team

Understanding the model architecture

“

- [Instructor] While we simply use the pipeline to execute out of the box models, it is also important to understand the underlying model, it's architecture, and hyper parameters. If we just print the model attribute of the pipeline object, it'll print out a model architecture graph that helps us understand the structure behind the scenes. Let's explore the ner model this way. Let's execute the model command. This shows the model graph. The name shows as BERTForTokenClassification which mentions the architecture and its purpose. The embeddings branch show information about the word embeddings used. The embedding has 28,996 tokens in its vocabulary with each token having a vector size of 1024. Then comes position embeddings. There are 512 rows which points to the maximum sentence length supported. The vector length is the same as the word embedding, which is 1024. Normalization and dropout information are also shown.…

Contents

- (Locked)
  
  Continuing with Transformers
  
  45s