top of page

Multilingual Chatbots supporting 50+ Languages, with Future Growth Hinged on Synthetic Data Innovations

Digipal Media Mgr


ChatGPT is a versatile multilingual chatbot and currently supports over 50 languages! 🌍💬 This includes Chinese, Japanese, Spanish, French, German, Russian, Arabic, Portuguese, Italian, and more. Large language models (LLMs) excel particularly in languages with extensive training data, encompassing diverse linguistic structures and idioms.


Substantial amounts of well-structured training data, such as example translations, are key for achieving high-quality results. The heat map derived from OPUS parallel corpora indicates the translation quality that can be expected across different languages. Obviously, there are quite some gaps.


Based on observations, data requirements increased about tenfold for every new model generation. What needs to happen for the models’ capabilities to evolve further?


On the assumption that commercial model makers won’t train on private data, then future models must rely heavily on synthetic data or some other new ideas will be required.

 
 
 

Comments


In the present digital age, where technology is integral to banking operations, ensuring data security has become crucial. At Digipal, even though we don't handle client identifying data (CID), we treat portfolio information as highly sensitive data and we have implemented stringent measures to ensure confidentiality. Our servers comply with industry standards for data security. Additionally, our APIs employ state-of-the-art encryption technology during data exchange. Furthermore, when designing our platform's workflows, we took care to ensure that portfolio data is not shared with other online service providers. Our commitment to data security remains unwavering.

©2023 by Digipal AG

bottom of page