Transformer Model Applications

A look under the hood of transfomers, the engine driving AI model evolution

Today, virtually every cutting-edge AI product and model uses a transformer architecture. Large language models (LLMs) such as GPT-4o, LLaMA, Gemini and Claude are all transformer-based, and other AI ...

SiliconANGLE

AI21 Labs’ updated hybrid SSM-Transformer model Jamba gets longest context window yet

OpenAI rival AI21 Labs Ltd. today lifted the lid off of its latest competitor to ChatGPT, unveiling the open-source large language models Jamba 1.5 Mini and Jamba 1.5 Large. The new models are based ...

Semiconductor Engineering

AI Transformer Models Enable Machine Vision Object Detection

The object detection required for machine vision applications such as autonomous driving, smart manufacturing, and surveillance applications depends on AI modeling. The goal now is to improve the ...

CU Boulder News & Events

Building a Vision Transformer Model From Scratch

The self-attention-based transformer model was first introduced by Vaswani et al. in their paper Attention Is All You Need in 2017 and has been widely used in natural language processing. A ...

Search Engine Land

Transformer architecture: An SEO’s guide

As we encounter advanced technologies like ChatGPT and BERT daily, it’s intriguing to delve into the core technology driving them – transformers. This article aims to simplify transformers, explaining ...

Geeky Gadgets

Etched Sohu super fast AI chip designed specifically for Transformer models

The Sohu AI chip, developed by the startup Etched, is making waves in the world of artificial intelligence. Hailed as the fastest AI chip ever created, Sohu promises to transform AI hardware with its ...

VentureBeat

Microsoft trains world's largest Transformer language model

Microsoft AI & Research today shared what it calls the largest Transformer-based language generation model ever and open-sourced a deep learning library named DeepSpeed to make distributed training of ...

EurekAlert!

Path Planning Transformers supervised by IRRT*-RRMS for multi-mobile robots

In a study published in Robot Learning journal, researchers propose a new learning-based path planning framework that allows mobile robots to navigate safely and efficiently using a Transformer model.

Some results have been hidden because they may be inaccessible to you

Show inaccessible results