Frequently asked questions – multilingual avatars
How do multilingual avatar videos work?
Get in touch with us using the contact form below to learn more.
What is Guildhawk Voice?
Guildhawk Voice is an AI video offering that allows you to create dynamic video content using an AI face and voice, based just on written materials. Polished videos can be created in hours, with no need for actors, physical space, or equipment, and can be hyper-localised through use of specific accents, a diverse range of avatars, and translation.
Guildhawk Voice allows you to present information in a different, more engaging way. And, if you want to make your videos highly personalised to your company and brand, we can even create avatars based on specific individuals, complete with case-by-case recording of an individual’s voice.
How exactly does Guildhawk Voice work?
Guildhawk Voice uses sophisticated text-to-speech technology and video synthesis to turn written material into content spoken by a synthetic face and voice, with lip movements to match.
This can be combined with quality translation to produce videos in multiple languages, accents and tones. See below for more information on the languages currently available.
Each video can have branded image or video backgrounds (including PPTs), and can even include background music, depending on what you need your content to achieve.
Is there human input, or does the AI work independently?
Like all Guildhawk processes, both elements work together to create the finished product. Where required, translation of materials, whether that’s text-to-speech content or any on-screen wording, is carried out by our linguists, using our other AI tools where appropriate.
Guildhawk’s in-house team will take the time to select culturally appropriate avatars and voices, will input the relevant text and translations, and will take care of any post-production and editing.
Avatars, synthetic voices, text-to-speech functionality and video synthesis are all handled by the AI.
What kind of content do we need to provide for Guildhawk Voice to work?
In order to create a simple video, all we need from you is text content – it’s really that simple!
For slightly more elaborate videos, if you have any specific images or visual content you would like to use as background, you will need to provide these. Ideal resolution is 1920x1080. And should you want to have a PPT running in the background, it will need to be exported as a video of 50mb or less in size.
Ideally, you would also be able to provide information re who you want to target with your video(s), so we can ensure your avatar is speaking in a language, accent and tone that will appeal directly to your audience.
What does Guildhawk Voice cost and how quickly can you produce my videos?
Costs for Guildhawk Voice start at £100, and short videos can be turned around in 1-2 working days.
How does Guildhawk Voice integrate with our existing systems?
We can use your existing visual collateral to create video backgrounds, and can leverage existing written material to create dynamic spoken content. This means you can produce entire videos without actually having to create any new content! We can also integrate Guildhawk Voice into larger deliveries, providing content as a combination of standard translation deliverables and videos, depending on what you need your material to do.
If you would like to incorporate Guildhawk Voice into your existing Learning Management System, this is easily done. Final videos are provided as MP4s, which can simply be uploaded to your LMS. We can even help you create and add elements such as checklists, tests and quizzes to accompany the videos.
Which languages does Guildhawk Voice support?
Guildhawk Voice currently supports 39 languages, as well as multiple accents and language variants,
e.g. Portuguese for Portugal and for Brazil:
Is my data confidential?
Guildhawk maintains UKAS-accredited ISO27001 certification, which is focused on information security and includes processes for protecting the integrity and confidentiality of data. Any content that you share with us as part of the Guildhawk Voice offering will be handled according to these secure information management practices.
How do the multilingual avatars used in Guildhawk Voice work?
Guildhawk Voice’s AI avatars use a combination of natural language processing techniques and a text-to-speech (TTS) engine to convert text into natural-sounding speech across multiple languages, accents and dialects.
The avatars’ facial expressions are created using a deep neural network, which ensures facial expressions are in sync with the synthesised speech generated by the TTS engine.
What are the top benefits of using an avatar to communicate information?
Anywhere, anytime – Every business has moments where a reaction to the market is required immediately. A digital avatar presents the information a business needs to convey in an engaging, professional manner, within minutes.
Trust – Presenting information using a custom avatar creates a truly personalised experience between a business and its customers. Digital avatars represent a business in a way that’s credible and professionally engaging.
Time and cost savings – Personalised services can be created quickly at the fraction of the cost with avatars, minimising human workload by removing the need to hire actors, a design team and film crew, in turn reducing operating costs and accelerating throughput.
Engagement – Presenting information in a digestible format such as a video improves engagement by as much as 4x when compared to the same content in written form.
Enhanced corporate training programs – Avatars allow HR departments to produce corporate training programmes at a fraction of the cost, while allowing employees to learn anywhere and across a large number of languages. As of 2019, LinkedIn states that 59% of corporate training budgets are spent on online courses. Digital avatars significantly reduce the cost of creating these training programs.
Aging populations – As the global population is estimated to double by 2050, an increasing number of senior citizens are looking to the Internet for personal enrichment and education. Avatars make digital content more accessible and create a simplified, engaging and human-like user experience.
Global market penetration – Artificial intelligence avatars extend digital content to a wider population, as information is presented in a variety of languages, accents and dialects. Clear synthesised speech improves engagement with those that have literacy and learning difficulties.
Why use a multilingual avatar instead of a real recording of an actor?
Creating digital content with a multilingual avatar has many benefits over using a real actor. The most obvious is the ease with which you can create videos – if you’re leveraging existing written content, you don’t even have to produce a script! This gives businesses greater flexibility to react in near real-time to events or communicate urgent information to their customers or employees.
Digital content presented by a multilingual avatar also reduces costs dramatically when compared to creating content with a real actor. Costs can quickly start to escalate when producing even a simple traditional video – you’ve got fees for an actor (or multiple actors, if your video is multilingual), studio, film crew, makeup artists, etc. Digital avatars incur none of these costs, meaning huge savings for the content creator.
Artificial Intelligence also enables a single avatar to speak a wide range of languages, including using local accents and dialects, reducing the communication and cultural barriers businesses often face in local markets.