SaluteSpeech (Sber) Integration for Speech Recognition

We design and deploy artificial intelligence systems: from prototype to production-ready solutions. Our team combines expertise in machine learning, data engineering and MLOps to make AI work not in the lab, but in real business.
Showing 1 of 1All 1566 services
SaluteSpeech (Sber) Integration for Speech Recognition
Simple
from 1 day to 3 days
Frequently Asked Questions

AI Development Areas

AI Solution Development Stages

Latest works

  • image_web-applications_feedme_466_0.webp
    Development of a web application for FEEDME
    1196
  • image_ecommerce_furnoro_435_0.webp
    Development of an online store for the company FURNORO
    1119
  • image_logo-advance_0.webp
    B2B Advance company logo design
    586
  • image_crm_enviok_479_0.webp
    Development of a web application for Enviok
    853
  • image_logo-aider_0.webp
    AIDER company logo development
    783
  • image_crm_chasseurs_493_0.webp
    CRM development for Chasseurs
    900

SaluteSpeech (Sberbank) Integration for Speech Recognition Salute

Speech is Sberbank's speech technology with a focus on the Russian language. Russian infrastructure, GOST compliance, and the ability to deploy on-premise. Particularly suitable for the banking, financial, and government sectors. ### Technical Specifications - WER in spoken Russian: 10–14% - Streaming Recognition Latency: 200–400 ms - Support for 8 kHz and 16 kHz audio - Diarization for up to 10 speakers ### REST API Integration```python import requests import base64

Получение токена

def get_token(): response = requests.post( "https://ngw.devices.sberbank.ru:9443/api/v2/oauth", headers={ "Authorization": f"Basic {base64.b64encode(f'{CLIENT_ID}:{CLIENT_SECRET}'.encode()).decode()}", "RqUID": "unique-request-id", "Content-Type": "application/x-www-form-urlencoded" }, data={"scope": "SALUTE_SPEECH_PERS"} ) return response.json()["access_token"]

Транскрипция

def transcribe(audio_bytes: bytes, token: str): response = requests.post( "https://smartspeech.sber.ru/rest/v1/speech:recognize", headers={ "Authorization": f"Bearer {token}", "Content-Type": "audio/x-pcm;bit=16;rate=16000" }, data=audio_bytes ) return response.json()["result"][0]["normalized_text"]