Python PyQt Convert Speech Recognition CodeSource

LipGen: Viseme-Guided Lip Video Generation for Enhancing Visual Speech Recognition

Abstract: Visual speech recognition (VSR), commonly known as lip reading, has garnered significant attention due to its wide-ranging practical applications. The advent of deep learning techniques and ...

IEEE

Efficient Streaming LLM for Speech Recognition

Abstract: Recent works have shown that prompting large language models with audio encodings can unlock speech recognition capabilities. However, existing techniques do not scale efficiently, ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

LipGen: Viseme-Guided Lip Video Generation for Enhancing Visual Speech Recognition

Efficient Streaming LLM for Speech Recognition

Trending now