Upstage's LLM Tops Global Open-Source AI Ranking


7/19/2023
  • Upstage ranked #1 in the HuggingFace Open LLM Leaderboard

  • Upstage 30B LLM beat larger models of tech giants, solidifying its leadership in the private AI market

  • The top-ranked model offers an optimal alternative to ChatGPT with better capaibility to mitigate security and AI hallunications risk


 

(Seoul, Jul. 19, 2023 /Upstage) Upstage, South Korea’s leading AI startup, has released the world's most advanced open-source generative AI model.

Upstage announced on July 19 that the company’s 30-billion-parameter Large Language Model (LLM) secured the top position on HuggingFace's Open LLM Leaderboard.

Hugging Face, with its mission to advance and democratize artificial intelligence through open source and open science, offers an evaluation tool that ranks and evaluates open-source LLMs and chatbots. Their LLM leaderboard boasts over 300 models tailored for various languages and purposes, comparable in capabilities to OpenAI's ChatGPT and Google's Bard, where numerous models being submitted each day. Evaluation is based on the average of four key metrics, such as the model’s reasoning challenge, common sense inference, contextual understanding, and factual accuracy.

LLM has garnered significant attention as the pinnacle of generative AI with its capacity to train on a plethora of text data and perform NLP (natural language processing)-based tasks. Major tech companies, including OpenAI's ChatGPT, Google's Bard, and Meta's LLaMa, have already jumped into the fierce competition in this field, while open-source LLMs are also making remarkable progress with models under 100 billion parameters via open-source platforms such as HuggingFace.

Upstage submitted its 30B LLM earlier this month to the leaderboard for evaluation, which quickly soared to the top of the list with an average score of 64.7, following closely behind Meta's LLaMa 2 70B model released on the same day. This achievement marks another milestone for a local startup managing to produce a competitive language model with just half the parameters of the global tech giant.

In a crucial metric evaluating the model's performance and its ability to mitigate AI hallucinations, Upstage scored an impressive 56.5, surpassing Meta's score of 52.8. Notably, Upstage outperformed established AI/LLM leaders such as Meta and Microsoft, as well as Stability AI and Databricks. Moreover, it even exceeded the performance of UAE's Falcon 40B and Databricks' Mosaic ML by approximately ten percent.

Upstage developed the top-ranked model in just two months, once again showcasing their well-established technological prowess in the field. The company's team of experts capitalized on their experience in creating a Korean language NLU (natural language understanding) dataset called "KLUE" and utilized prompt engineering and fine-tuning techniques gained from operating "AskUp," Korea's most popular generative AI application.

The Upstage 30B model's domination in the open-source LLM ranking marks a momentous turning point in Upstage's path towards establishing itself as a frontrunner in the rising "private AI" market worldwide.

Upstage 30B model highlights the potential of a compact and efficient LLM as a cost-effective alternative for companies looking to integrate generative AI into their business operations. While there is a growing demand for deploying state-of-the-art language models, many companies are reluctant to rely solely on ChatGPT or Bard for all their operations due to the inherent risk of data leakage.

To tackle this issue, private AI solutions are emerging as a new chapter in the generative AI revolution. These models are trained specifically on the company's private data, guaranteeing the security of sensitive information and reducing the possibility of producing inaccurate or misleading outputs. Industry bellwethers such as Apple, Walmart, Amazon, and JP Morgan have already banned their employees from using ChatGPT. Samsung Electronics is also taking steps to introduce its in-house language model to address security considerations.

With Upstage's small-but-powerful LLM, companies and institutions now have access to an advanced AI model without the worry of information leakage. This presents an opportunity for them to develop internal AI solutions by training the model using their own internal data and protocols. This enables employees to easily access and benefit from the company's knowledge management system. Another potential application is utilizing the AI solution to analyze sales data and create optimal marketing strategies.

In order to further enhance the performance of its model and meet the growing demands of domestic enterprises across various industries, Upstage has plans to train its model using an additional set of Korean language data. This proactive approach demonstrates Upstage's commitment to continuous improvement and its dedication to meeting the evolving needs of its clients in the local market.

Sung Kim, CEO of Upstage, said, "I am thrilled that Upstage’s AI capabilities dominated the global ranking.” He added, “Moving forward, Upstage will not only focus on enhancing the performance of the Korean language, but also strive to become a global leader in the emerging private AI market for businesses.”

 
 
 

※ Photo description: Startup Upstage, which operates AskUp, a representative generative AI service in Korea, has created the world's best generative AI model. The photo shows an Upstage model taking first place on the Hugging Face Open LLM Leaderboard.

 
 
  • Upstage | Keunkyo Kim, PR Director | keunkyo@upstage.ai Upstage | Sungbeom Bae, PR Manager | sungbae@upstage.ai

    Download press release

  • Upstage, founded in October 2020, offers a no-code/low-code solution called "Upstage AI Pack" to help clients innovate in AI. This solution applies the latest AI technologies to various industries in a customized manner. Upstage AI Pack includes OCR technology that extracts desired information from images, recommendation technology that considers customer information and product/service features, and natural language processing search technology that enables meaning-based search. By using the Upstage AI Pack, companies can easily utilize data processing, AI modeling, and metric management. They can also receive support for continuous updates, allowing them to use the latest AI technologies conveniently. Additionally, Upstage offers practical, AI-experienced training and a strong foundation in AI through an education content business. This helps cultivate differentiated professionals who can immediately contribute to AI business.

    Led by top talents from global tech giants like Google, Apple, Amazon, Nvidia, Meta, and Naver, Upstage has established itself as a unique AI technology leader. The company has presented excellent papers at world-renowned AI conferences, such as NeurIPS, ICLR, CVPR, ECCV, WWW, CHI, and WSDM. In addition, Upstage is the only Korean company to have won double-digit gold medals in Kaggle competitions. CEO Sung Kim, an associate professor at Hong Kong University of Science and Technology, is a world-class AI guru who has received the ACM Sigsoft Distinguished Paper Award four times for his research on bug prediction and automatic source code generation. He is also well-known as a lecturer for "Deep Learning for Everyone," which has recorded over 7 million views on YouTube. Co-founders include CTO Hwal-suk Lee, who led Naver's Visual AI/OCR and achieved global success, and CSO Eun-jeong Park, who led the modelling of the world's best translation tool, Papago.

 
Previous
Previous

Upstage's 70B Language Model Outperforms GPT-3.5, Becomes Global No.1

Next
Next

Upstage Represents Korean AI Industry at Google-MSIT's AI for Korea 2023