Question 1

Ядро технологий: На какой платформе построен бот?

Accepted Answer

Собственная разработка - оркестратор. Модульная архитектура позволяет подключать разные провайдеры ASR, TTS, LLM и интегрироваться с внешними системами.

Question 2

Чьи технологии используются для распознавания речи (Speech-to-Text)?

Accepted Answer

Поддерживаются основные провайдеры AI для распознавания речи, а также собственные inference-решения. Система устойчиво обрабатывает акценты, паузы, спонтанную речь и самокоррекции.

Question 3

Как бот понимает контекст диалога?

Accepted Answer

Да. Используется сценарный промпт, контекстная память диалога и логика ветвления сценариев. Рекомендуем формировать промпты на основе записей реальных звонков для учета всех бизнес сценариев.

Question 4

С какими системами легко интегрируется?

Accepted Answer

Поддерживаются интеграции с CRM-системами, телефонией, 1С, базами знаний, мессенджерами. Возможна разработка кастомных коннекторов под требования заказчика.

Question 5

Модель оплаты: Какова модель ценообразования?

Accepted Answer

Поминутная тарификация с пакетами от 100 до 100 000 минут. Стоимость минуты от 8 до 25 рублей. Тарификация посекундная.

Aiva

Frequently asked questions

1.Core stack: which platform powers the bot?

2.Whose technology powers speech-to-text? Does it support different accents, dialects, spontaneous speech with pauses and corrections?

3.Whose technology powers text-to-speech? What voices are available (male, female, neutral)? Can timbre, speed and intonation be tuned?

4.How does the bot understand the dialog context? Can it sustain a multi-turn conversation with clarifications?

5.How does it handle typos, slang and complex phrasing?

6.Are there built-in scenarios (intents) for an industry (e.g. doctor's bookings, table reservations, support)?

7.Which systems does it integrate with (CRM — AmoCRM, Bitrix24; telephony; 1C; knowledge bases; messengers)?

8.Where and how is conversation data stored? Do you ensure compliance with 152-FZ (Russia) or GDPR?

9.Latency: what is the average response time from the end of the customer's utterance to the bot's reply (in milliseconds)?

10.Throughput: how many simultaneous conversations can the system handle? How does it scale during traffic peaks (e.g. an advertising campaign)?

11.Voice sources: are stock provider voices used or custom-built?

12.Quality and naturalness: can we hear live samples (demo recordings)? How emotional and natural is the speech (neural TTS)?

13.Can the voice be tuned to our brand (gender, age, character)?

14.Can a synthesized voice be created for a key employee or brand persona?

15.Can speech speed, key-word emphasis and pauses be adjusted?

16.Audio production: do you provide recording of welcome messages, jingles, background music?

17.Pricing model: how is the service priced (monthly/yearly subscription, per-minute, per successful dialog)?

18.Implementation timeline: how long does setup and launch take for a typical / non-typical project?

19.Who configures the bot — us via the builder or your team?

20.Bot training: how does initial training happen? How easy is it to add new questions and answers after launch?

21.Analytics and reports: what data does the system provide (recordings, transcripts, dialog map, metrics: resolution rate, hand-off reason, sentiment)?

22.Key advantage: what is your main differentiator from other market solutions (technology or outcome, not price)?

Still have questions?