Sony Patents Next-Gen Chat System Enabling Real-Time Lip Sync Control in Games

Expert Verified By

The System Will Control Gameplay By Reading Your Silent Lip Movements!

Story Highlight
  • A new Sony patent proposes that players control games solely through lip movements.
  • Users can chat with others using silent lip movements. The system automatically produces speech. 
  • The patent also discusses a real-time translation feature that translates speech into other players’ languages.

PlayStation and Xbox have seen great innovations over the last decade, but the voice chat feature has remained practically unchanged. A new Sony patent filing proposes a next-gen audio system that enables real-time game controls via silent lip movements.

We have found a new Sony patent that describes a next-gen chat-and-voice control system. It uses advanced AI and sensors to let players control single and multiplayer games with lip movements, without producing any sounds.

So, a user can issue voice commands to control gameplay and other mechanics, such as pausing and opening the menu. It is pitched as a great accessibility feature for gamers with stutters or tremors. But there’s much more to this patent.

A form of ‘silent’ voice control may be implemented based on the determined intended speech input of the first user. This provides a new form of input for users, and can further improve accessibility of the game by allowing speech impaired users to control the game without providing manual inputs.

Why it matters: Sony could revolutionize gaming with this next-gen voice system, allowing players to control games and chat with others solely through lip movements. Those who stutter or are unable to speak loudly for any reason can greatly benefit.

The schematic flowchart shows the new way of communication between two players in a game.
The schematic flowchart shows the new communication protocol between two players in a game.

The patent “AUDIO PROCESSING METHOD AND SYSTEM” will also allow players to chat with others using only lip movements in multiplayer games. The mouthing would be interpreted using sensors (camera, depth, infrared, electromyography, etc.) and machine learning to capture lip, tongue, and facial movements.

The system would generate natural, context-aware speech that can match the user’s real voice or a preferred alternative. It may create urgent whispers if the gameplay is stealth-based while health is low, for example.

Sony also discusses adding a real-time translation feature that converts your speech into the language of other players.

In addition, users may find communication over voice chat difficult due to speech issues, cognitive issues, not being able to speak fluently in the language being used to communicate, or the like.

The image shows the devices of first and second users who are communicating using the new system.
The image shows the devices of the first and second users who are communicating using the new system.

Sony says speaking while gaming can be difficult. Making a lip-sync voice system can be a useful accessibility and convenience feature. So, we can expect the publisher to bring the discussed features into its upcoming single and multiplayer titles.

Sony has published a motley of unique patents in the past, such as one discussing a method to simplify console game development without using devkits and another about the PS6 featuring an innovative dustproof design.

Do you think the Sony patent will open up PlayStation to a whole new audience of disabled users? Let us know your thoughts in the comments below, or join the discussion on the Tech4Gamers forum.

Was our article helpful? 👨‍💻

Thank you! Please share your positive feedback. 🔋

How could we improve this post? Please Help us. 😔

Gear Up For Latest News

Get exclusive gaming & tech news before it drops. Sign up today!

Join Our Community

Still having issues? Join the Tech4Gamers Forum for expert help and community support!

Latest News

Join Our Community

104,000FansLike
32,122FollowersFollow

Trending

Can Intel’s “Panther Lake” Kill the Entry-Level GPU?

Intel Panther Lake’s Arc B390 iGPU challenges RTX 4050-level performance, signaling a major shift in laptop gaming and integrated graphics.

CS2 Skin Prices Explained: Why the Same Skin Costs Different Amounts on Different Platforms

Wondering why CS2 skin prices differ across platforms? Learn how fees, demand, float values, and comparison tools help players find better deals.

DLSS 4.5 vs Native: Is 6X Mode the Future of Gaming?

Testing NVIDIA DLSS 4.5’s 6X mode vs native rendering to see if AI-generated frames can truly replace traditional performance.

Why 60 FPS Might Be Lying to You

Why 60 FPS gaming isn’t always the best choice for visual quality, performance, or immersion in modern games.

Pragmata Turns the “Sad Dad” Trope Into Something New

Pragmata redefines the "Sad Dad" genre by transforming the frustrating escort mission into a meaningful, co-dependent partnership,