23
H2020 COMPRISE - Cost effective, Multilingual, Privacy-driven voice-enabled Services COMPRISE: Cost-effective, Multilingual, Privacy- driven voice-enabled Services Showcasing Language Technology in H2020 and INEA projects Zaineb Chelly − Project Manager Inria Joint work with 26/04/2019

Showcasing Language Technology in H2020 and INEA projects...Store the user’s speech in the cloud (> 1,000 h & 10,000 dialogs) Hire humans to: o Manually transcribe speech as a sequence

  • Upload
    others

  • View
    1

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Showcasing Language Technology in H2020 and INEA projects...Store the user’s speech in the cloud (> 1,000 h & 10,000 dialogs) Hire humans to: o Manually transcribe speech as a sequence

H2020 COMPRISE - Cost effective, Multilingual, Privacy-driven voice-enabled Services

COMPRISE: Cost-effective, Multilingual, Privacy-

driven voice-enabled Services

Showcasing Language Technology in

H2020 and INEA projects

Zaineb Chelly − Project ManagerInria

Joint work with

26/04/2019

Page 2: Showcasing Language Technology in H2020 and INEA projects...Store the user’s speech in the cloud (> 1,000 h & 10,000 dialogs) Hire humans to: o Manually transcribe speech as a sequence

1. Context

2. Issues

3. Objectives

4. Approach

Outline

2

Page 3: Showcasing Language Technology in H2020 and INEA projects...Store the user’s speech in the cloud (> 1,000 h & 10,000 dialogs) Hire humans to: o Manually transcribe speech as a sequence

Voice interaction today: operating branch

3

Page 4: Showcasing Language Technology in H2020 and INEA projects...Store the user’s speech in the cloud (> 1,000 h & 10,000 dialogs) Hire humans to: o Manually transcribe speech as a sequence

Store the user’s speech in the cloud (> 1,000 h & 10,000 dialogs)

Hire humans to:

o Manually transcribe speech as a sequence of words

o Manually label dialog acts (question, statement, request, etc.) and slot values (e.g., for heating control, which heater, which temperature, when, and for how long)

Train speech-to-text, spoken language understanding, and dialog management systems

Repeat for every language

Today’s training approach

4

Page 5: Showcasing Language Technology in H2020 and INEA projects...Store the user’s speech in the cloud (> 1,000 h & 10,000 dialogs) Hire humans to: o Manually transcribe speech as a sequence

1. Context

2. Issues

3. Objectives

4. Approach

Outline

5

Page 6: Showcasing Language Technology in H2020 and INEA projects...Store the user’s speech in the cloud (> 1,000 h & 10,000 dialogs) Hire humans to: o Manually transcribe speech as a sequence

Issue 1: Privacy

6

User’s speech reveals information about:

user’s identity

traits (gender, ethnic origin, etc.) and states (health condition, etc.)

critical personal information (credit card, phone number, etc.)

user preferences (e.g., asking the system about another product)

sensitive business data (sales)

This results in:

risk of identity theft in case of security breach (cf. deepfakes)

unsolicited advertisement

business competition (cf. Amazon)

Page 7: Showcasing Language Technology in H2020 and INEA projects...Store the user’s speech in the cloud (> 1,000 h & 10,000 dialogs) Hire humans to: o Manually transcribe speech as a sequence

Issue 2: Cost

7

Transcribing the users’ speech data requires human annotators.

Programming dialog APIs requires application developers to knowthe language well enough, or to pay a human translator.

This incurs a huge cost.

Page 8: Showcasing Language Technology in H2020 and INEA projects...Store the user’s speech in the cloud (> 1,000 h & 10,000 dialogs) Hire humans to: o Manually transcribe speech as a sequence

Issue 3: Non-inclusiveness

8

Due to cost issues, commercial speech and dialog APIs are onlyavailable for few languages.

Users can only interact with systems developed for their ownlanguage.

Gap between certain users (e.g., people with non-native orregional accents) and others in terms of speech-to-textperformance.

Page 9: Showcasing Language Technology in H2020 and INEA projects...Store the user’s speech in the cloud (> 1,000 h & 10,000 dialogs) Hire humans to: o Manually transcribe speech as a sequence

1. Context

2. Issues

3. Objectives

4. Approach

Outline

9

Page 10: Showcasing Language Technology in H2020 and INEA projects...Store the user’s speech in the cloud (> 1,000 h & 10,000 dialogs) Hire humans to: o Manually transcribe speech as a sequence

Define a fully private-by-design methodology and tools that will reduce the cost and increase

the inclusiveness of voice interaction.

Overall objective

10

Page 11: Showcasing Language Technology in H2020 and INEA projects...Store the user’s speech in the cloud (> 1,000 h & 10,000 dialogs) Hire humans to: o Manually transcribe speech as a sequence

Detailed objectives

11

Objectives Solutions

Privacy-by-design Privacy-driven data transformation

InclusivenessMachine translation

+ user personalization

Cost-effectivenessWeakly supervised learning

+ cross-platform SDK

Sustainability Cloud-based platform

Real-world applicability Demonstrators

Page 12: Showcasing Language Technology in H2020 and INEA projects...Store the user’s speech in the cloud (> 1,000 h & 10,000 dialogs) Hire humans to: o Manually transcribe speech as a sequence

1. Context

2. Issues

3. Objectives

4. Approach

Outline

12

Page 13: Showcasing Language Technology in H2020 and INEA projects...Store the user’s speech in the cloud (> 1,000 h & 10,000 dialogs) Hire humans to: o Manually transcribe speech as a sequence

COMPRISE framework

13

Page 14: Showcasing Language Technology in H2020 and INEA projects...Store the user’s speech in the cloud (> 1,000 h & 10,000 dialogs) Hire humans to: o Manually transcribe speech as a sequence

COMPRISE app development

14

Developers set up an app including allCOMPRISE functionalities with onecommand, supported by COMPRISEvarious development tools.

Usage of COMPRISE features toimplement In-App-Logic.

Angular app will be compiled to beexecutable on multiple platforms.

Page 15: Showcasing Language Technology in H2020 and INEA projects...Store the user’s speech in the cloud (> 1,000 h & 10,000 dialogs) Hire humans to: o Manually transcribe speech as a sequence

COMPRISE cloud platform

15

Cloud platform provides access touser-independent models, analysismechanisms, model training andmanagement, etc.

Gives possibility to label und curatedata.

Serves domain-specific neutralmodels to COMPRISE applications.

Stores training data securely, whichis received from devices.

Page 16: Showcasing Language Technology in H2020 and INEA projects...Store the user’s speech in the cloud (> 1,000 h & 10,000 dialogs) Hire humans to: o Manually transcribe speech as a sequence

COMPRISE app + SDK

16

SDK is a library running onCOMPRISE Apps.

Receives neutral modelsfrom cloud and createspersonal models withpersonalized learning.

Personalized models beingused for Speech-to-text,Spoken LanguageUnderstanding, etc.

Training data sent to CloudPlatform.

Page 17: Showcasing Language Technology in H2020 and INEA projects...Store the user’s speech in the cloud (> 1,000 h & 10,000 dialogs) Hire humans to: o Manually transcribe speech as a sequence

Demonstrators

17

COMPRISE demonstrators Provides

Smart Consumer Apps• Cooking support app for smart assistants• Presentation control app for smart glasses• Note taking app for smartphones

E-commerce • Drive-thru demonstrator platform

E-health• Medical record app for medical staff• Registration desk app for patients

Page 18: Showcasing Language Technology in H2020 and INEA projects...Store the user’s speech in the cloud (> 1,000 h & 10,000 dialogs) Hire humans to: o Manually transcribe speech as a sequence

H2020 COMPRISE - Cost effective, Multilingual, Privacy-driven voice-enabled Services

www.compriseh2020.eu

Thank you for your attention

Questions?

@compriseh2020

https://www.linkedin.com/company/comprise-h2020

Page 19: Showcasing Language Technology in H2020 and INEA projects...Store the user’s speech in the cloud (> 1,000 h & 10,000 dialogs) Hire humans to: o Manually transcribe speech as a sequence

Voice interaction today: training branch

19

Page 20: Showcasing Language Technology in H2020 and INEA projects...Store the user’s speech in the cloud (> 1,000 h & 10,000 dialogs) Hire humans to: o Manually transcribe speech as a sequence

Work packages

20

WP1Project management

WP3 Multilingual

personalized voice interaction

WP2Privacy-driven voice

interaction

WP5Cloud-based platform for

multilingual voice interaction

WP4Cost-effective multilingual

voice interaction WP6Evaluation and demonstration

for practicaluse cases

WP7 Communication,

dissemination and exploitation

WP8 Ethics

Page 21: Showcasing Language Technology in H2020 and INEA projects...Store the user’s speech in the cloud (> 1,000 h & 10,000 dialogs) Hire humans to: o Manually transcribe speech as a sequence

COMPRISE voice interaction system

21

Page 22: Showcasing Language Technology in H2020 and INEA projects...Store the user’s speech in the cloud (> 1,000 h & 10,000 dialogs) Hire humans to: o Manually transcribe speech as a sequence

Key new features

22

COMPRISE features Initial TRL Final TRL

Privacy-driven speech transform 3 4

Privacy-driven text transform 3 4

Weakly supervised learning algorithm

3 4

COMPRISE SDK 4 6

Cloud-based platform 4 6

Page 23: Showcasing Language Technology in H2020 and INEA projects...Store the user’s speech in the cloud (> 1,000 h & 10,000 dialogs) Hire humans to: o Manually transcribe speech as a sequence

Research & Innovation KPIs

23

KPI-RI-1 Release of COMPRISE SDK

KPI-RI-2 Formal/empirical privacy guarantees

KPI-RI-3 Support of at least 6 languages

KPI-RI-4 Interact with system in another language

KPI-RI-5 40% cost reduction

KPI-RI-6 Exploitation plan incl. 3-year maintenance

KPI-RI-7 Successful demonstration