5 major challenges and solutions for enterprise-level implementation of large models

Click to view the complete event review: https://my.oschina.net/u/4489239/blog/11105657

Click to jump to the preview of Shenzhen Yuanchuang Conference on May 18: https://www.oschina.net/event/2332004

On April 20, the 102nd Yuanchuang Conference was successfully held in Wuhan. This issue invites artificial intelligence experts from Wuhan Artificial Intelligence Research Institute, Huawei, MindSpore, JD Cloud, and Gitee AI to give speeches on the theme of [Large Model Competition and Performance Optimization]. Yuan Lijiang, product director of JD Cloud, delivered a keynote speech on "Inspiring the Future with Intelligence - Yanxi Large Model Platform". Yuan Lijiang introduced that there are five major challenges in the enterprise-level implementation of large models: real-time, explainability, security and controllability, complex decision-making, and professionalism. The key to implementation is how to make correct decisions in real time and in an uncertain and dynamically changing environment. implement.
 
Yuan Lijiang introduced that there are two main ways to implement large models. One is the Copilot model. The interaction relationship is human-led. AI only serves as an assistant. In some scenarios, AI completes the work, such as text content generation and processing. , Vincent Tu, etc. In fact, for enterprises, they need to release manpower as much as possible. The other is the Agent mode, which is more suitable for complex scenarios in enterprises. In this mode, humans stand from a higher-dimensional perspective and act as the "mentor" or "coach" of artificial intelligence, setting goals and supervising the results. The large model can exert its reasoning ability, use appropriate tools and excuses, and finally give corresponding result feedback.
 
The main technologies relied on for the implementation of large models in enterprises have also changed. The initial Pre-train has the highest cost and huge investment; later, the cost of SFT mode decreased but the implementation effect was not good; the retrieval based on vector database enhanced RAG mode, but the effect was improved. It can only be limited to knowledge question and answer scenarios; in the end, proficient technical teams pay more attention to the Agent mode and can achieve multi-scenario support.
 
In JD.com's financial business, it is difficult to improve the ability of large models to solve practical problems simply by relying on large model SFT or LoRA. Instead, it is based on Agent technology to enable machines to use tools to solve business problems. Specifically, it uses the Agent to understand the user goals, disassemble each sub-task, and select appropriate tools for each sub-task. These tools are some interfaces of JD.com’s original business, and finally combined with large model capabilities to provide feedback. In this way, answers to some users' complex questions will be more accurate.
 
At present, JD Yanxi’s full model platform has built a multi-layered product matrix. The lowest layer is resource support, including computing resources, storage resources, high-speed network and resource scheduling. In the model resource layer, it provides capabilities such as model management and training, data set processing, and model evaluation and deployment. Above the model resource layer is the construction of intelligent agents, focusing on the integration of various tools. The top layer is the application service layer, which adapts to multiple enterprise scenarios.
 
JD Yanxi's large model platform has 6 major functions: resource scheduling collaboration, which can realize efficient management and scheduling of computing resources, ensuring performance optimization and cost control of large model development and application; data management, which provides management and support for large model training Pre-training, fine-tuning, reinforcement learning, evaluation, etc. are carried out efficiently; model training, training and fine-tuning through large models allows enterprises to have customized models to improve accuracy and relevance; intelligent agent construction helps enterprises create and deploy intelligent agents, Combined with the enterprise's existing IT systems to perform complex tasks; security compliance ensures that all large-model applications comply with security standards and legal and regulatory requirements; the intelligent application market provides a series of pre-built large-model applications that enterprises can deploy directly or provide plug-ins Quick access to the system.
 
Scan the QR code to watch the replay of the speech "Inspiring the Future——Yanxi Large Model Platform"⬇️
High school students create their own open source programming language as a coming-of-age ceremony - sharp comments from netizens: Relying on the defense, Apple released the M4 chip RustDesk. Domestic services were suspended due to rampant fraud. Yunfeng resigned from Alibaba. In the future, he plans to produce an independent game on Windows platform Taobao (taobao.com) Restart web version optimization work, programmers’ destination, Visual Studio Code 1.89 releases Java 17, the most commonly used Java LTS version, Windows 10 has a market share of 70%, Windows 11 continues to decline Open Source Daily | Google supports Hongmeng to take over; open source Rabbit R1; Docker supported Android phones; Microsoft’s anxiety and ambitions; Haier Electric has shut down the open platform
{{o.name}}
{{m.name}}

Guess you like

Origin my.oschina.net/u/4489239/blog/11105666