Vientiane in the world always needs a "port" to enter

37ccb4451eb8f93ebd9118e8ea20c1d2.png

Want to dive into the new world of Vientiane,

Cooler all-purpose "terminal" development tools are needed.

Cloud 

Imagine

Video, digitization, and intelligence are reborn in the world, but to enter this wave of digitalization dominated by audio and video, a universal terminal is needed.

Currently, terminal performance challenges are intensifying, and terminal-side computing power encounters bottlenecks... Is there a possibility for all enterprises to withdraw from complex audio and video development projects and return to the business itself?

How to obtain one-stop audio and video services? How to simplify the lengthy and cumbersome SDK access process? Can you still take the initiative while lowering the development threshold?

Will scene intelligence be the next trend?

How to release more "digital productivity" by building audio and video technology capabilities for scenarios?

To deal with diversification, how can the audio and video terminal suite meet the "thousands of lines"?

This article was planned and interviewed by IMMENSE, Hong Bingfeng, the person in charge of the media service application side of "Alibaba Cloud Video Cloud", and LiveVideoStack.

03a6457389d87f21f4d11b1d35769c14.png

01

Riding on the digital wave

What are we talking about when we talk about digitization of industry?

In the past few years, the digital construction of the industry has been in full swing.

According to authoritative data, 65% of industry digital information comes from video, and 5% of audio information is also included. It has to be said that audio and video technology is very important to the digitalization of the industry, and the digitalization of audio and video is also the leading state of the digitalization of the industry .

It is obvious that the audio and video technology carried by cloud computing is accelerating its penetration into all walks of life. From the consumer Internet to traditional industries such as education, medical care, finance, and retail, the industrial Internet has set off a wave of digital audio and video. Remote proctoring, telemedicine, etc. , Manufacturing monitoring...Video has long been a trend in Vientiane.

More importantly, the emergence of these new scenes is constantly spawning new demands for audio and video capabilities. Obviously, the digitization of audio and video in the industry no longer relies solely on a few individual PaaS atomic capabilities, but requires an organic combination of audio and video capabilities .

Therefore, scene-based audio and video solutions are imminent .

We found that with the deepening of audio and video application scenarios, the demand for "multiple SDKs" has gradually become "standard configuration".

Often an audio and video scene requires different SDKs to work together, such as live streaming streaming + playback, short video shooting + playback, live streaming + RTC real-time interaction, etc. However, the development cycle of several months often discourages many companies.

Of course, it's not just a pain point in the development cycle. Therefore, when we talk about the digitization of audio and video in the industry, we are talking about the breakthrough of many pain points .

b969d6c0a510103b04b56426bbbc5a2d.png

Audio and video development, need a set of powerful "combination boxing"?

In the face of huge audio and video development projects, the pain points of the enterprise's landing are obvious:

The threshold for audio and video development is high.

Audio and video seem to be common, but it is a highly professional field, especially in the development of the terminal side requires rich technical experience, and audio and video talents are in short supply, and most enterprises do not have such reserves.

Multiple SDK access is complicated.

Each SDK needs to be docked and authorized separately, and the atomization of SDK access capabilities further increases the difficulty of multi-SDK mutual adaptation.

It is difficult to be compatible with a large number of devices.

In the process of industry videoization, more "lightweight" requirements follow, and terminal forms such as Web/small programs are becoming more and more critical in many scenarios. How to ensure the SDK compatibility of a large number of multi-terminal devices has become more and more difficult.

Based on these industry trends and challenges, the audio and video demands of enterprises can be summarized as: multi-SDK combinations and solutions with high ease of use, high performance, scenario-based, and multi-terminal.

Facing the wave of audio and video digitalization, and wanting to solve all the pain points, a set of Media "combination boxing" needs to be introduced urgently, so that more companies can get out of the "audio and video development dilemma" and refocus their attention on their own business logic.

b471d5c4f801d07d596db6d76d6ec4bb.png

02

Redefine "high ease of use"

MediaBox, a treasure chest for audio and video development?

In order to completely solve the pain points of audio and video development, Alibaba Cloud Video Cloud launched the MediaBox audio and video terminal integration kit, as a powerful tool for media development, which can help enterprises accelerate the process of audio and video digitization.

MediaBox, as the name suggests, is an all-encompassing audio and video magic box . The core includes an integrated audio and video terminal SDK and a series of AUI Kits low-code solutions for different scenarios.

MediaBox audio and video terminal SDK , with a unified technical base, allows all audio and video SDKs to be deeply integrated under the integrated architecture, realizing completely free and flexible combination, while reducing the size to the extreme. At the same time, open up the multi-terminal underlying architecture, realize multi-terminal integration, and realize the multiplexing of a set of codes. Currently, it has covered iOS, Android, Web, Win, Mac and other terminals.

MediaBox low-code development AUI Kits is a low-code integration method including UI. On the basis of the SDK, AUI Kits encapsulates the scenario-based UI implementation, and realizes the linkage between App Server and UI, and builds an end-to-end scenario-based solution as a whole.

Relying on the powerful PaaS cloud service and underlying network technology of Alibaba Cloud Video Cloud, MediaBox is like a treasure chest, which can take audio and video capabilities at will and combine them freely, and the size is as thin as a cicada's wing, which can easily cope with the digitalization of audio and video in the industry. of various complications.

9719a8d15c43303ba61b416171f0eeca.png

Who will take the initiative in development?

The ease of use, convenience, and efficiency of the audio and video terminal suite are the foundation. MediaBox has refreshed the definition of "high ease of use" through the flexible combination of SDK, low-code access of AUI Kits, open source and open source, and support for secondary development .

➤ SDK is simple and flexible

MediaBox provides more than 15 kinds of SDK combinations. According to different application scenarios, the SDK with corresponding audio and video capabilities can be freely selected, and it only needs to be connected once, and it can be used after one license authorization, which greatly simplifies the SDK access process.

➤ AUI Kits hourly online

As an end-to-end solution for audio and video services, AUI Kits encapsulates multiple SDKs and cloud PaaS capabilities in a scene-based manner, and packages and outputs scene function components and a relatively complete UI implementation.

Quick access and running through the "low-code" method can shorten the integration time at the monthly/weekly level to the hourly level, greatly reducing the access cost of the enterprise. Enterprises do not need to care about the complex logic and best practices of the audio and video SDK, but can focus more on their own business implementation.

➤ AUI Kits open source and open, personalized customization

In addition to supporting agile development, AUI Kits is also a new upgrade on the original low-code audio and video factory:

Provide open source and open UI and App Server source code, allow customers to develop secondary, customize brand logo and visual style, and realize personalized business customization, so that enterprises can still grasp the development initiative while lowering the development threshold and shortening the development cycle .

Compared with the faster pursuit of low-code audio and video factories in the past, MediaBox focuses on a high degree of flexibility and ease of use.

It is worth mentioning that the AUI Kits solution is currently free. Enterprises only need to pay for the PaaS capability to have an access experience close to SaaS and enjoy the low-cost advantages of PaaS.

"Easy to use" audio and video development tools are being redefined. In a flexible, fast, agile, personalized, and low-cost way, MediaBox helps enterprises acquire audio and video capabilities in one-stop fashion.

45e3d06742f6539bf67a11e87e5f9927.png

03

More than just "tools"

On top of "high ease of use", what is another mission of the tool?

Audio and video development tools must not only ensure "high ease of use" before access, but also satisfy "extreme ease of use" after access.

Enterprises' expectations for audio and video are high fluency, low latency, ultra-high definition, strong stability, and low cost.

Based on this, MediaBox continuously optimizes the basic performance and core indicators of audio and video terminals with a highly available stability system, a unified data index system, and a complete automated test system to provide customers with the ultimate experience.

In order to ensure efficient online operation and maintenance, MediaBox has also built an end-to-end full-link troubleshooting tool. Through intelligent analysis, it can quickly locate the link node where the problem occurs, and discover, troubleshoot and solve the problem faster.

At the same time, the deep integration of the cloud and the terminal has brought the "extreme ease of use" of audio and video to a higher level.

Combining the underlying network, AI technology, and cloud processing capabilities, Alibaba Cloud Video Cloud has created a cloud-integrated, end-to-end, and full-link overall solution to meet the needs of different customers in audio and video scenarios.

Just like the "strong alliance" between MediaBox and MediaUni, a multi-converged streaming media transmission network, it can provide customers with ordinary live broadcast in 5-6s, ultra-low-latency live RTS with a delay of less than 1s, and meta-rendering service support in 60ms , different end-to-end delay options to meet the diverse business needs of enterprises.

b64536cde55909baec1044d88afc5034.png

Tools with "scenario intelligence" are the future?

With the development of large AI models, it will become an inevitable trend for some lightweight models to run on terminals. AI models will derive more terminal intelligence capabilities during the process of industrialization .

In the scenario-based practice, MediaBox is also based on end-to-end intelligent technology, constantly innovating and breaking through.

For example, in the player SDK, intelligent preloading will use intelligent algorithms to dynamically control the size of preloading cache and memory cache based on information such as current network conditions, user sliding behavior, and historical playback behavior, which can save preloading traffic and improve preloading. The efficiency of content usage achieves the ultimate balance between cost and experience.

With the deepening of more scenes, MediaBox will evolve more scene intelligence capabilities.

For example, in the one-to-many scenario of distance teaching, the decrease of students' concentration leads to poor teaching effect, which is an eternal pain point of distance teaching.

In this context, MediaBox launched an intelligent detection SDK for concentration, which can detect changes in students' status in real time, and feedback students' concentration to teachers, helping teachers to perceive students' class status in a timely manner and improve the overall teaching effect.

Scenario intelligence brings more possibilities for business empowerment. The audio and video terminal kit is not only a simple development tool, but also an innovative port of the industry, which endows the scene with brand-new digital intelligence capabilities in the lightest way.

c5a41e990f843240ea59441888541e2a.png

04

Vientiane world, a "device" in the lead

Development tools can meet the "thousands of thousands of faces"?

Behind the rapid progress of "industry digitalization" is a deep understanding of the industry scene.

Looking back at the development history of audio and video technology, audio and video have developed and grown in the interactive entertainment industry, the scene is relatively simple and mature, and the requirements for audio and video capabilities are relatively general.

When audio and video penetrate into more traditional industries, because traditional industries are composed of different scenes, each scene has different characteristics and has obvious industry attributes . Video capabilities can better meet the digital needs of the industry.

At present, Alibaba Cloud Video Cloud has launched the MediaBox multi-scenario AUI Kits solution and multiple SDKs for different industry scenarios, including entertainment live broadcast , e-commerce live broadcast, and enterprise live broadcast in live broadcast scenarios, remote proctoring in interactive scenarios , interactive classrooms, and language in communication scenarios . Chat rooms, KTV, and short and long videos of on-demand scenes .

➤ In the education industry, the interactive classroom AUI Kit solution for remote teaching scenarios supports intelligent real-time detection of student concentration, 10,000+ students interact with the whiteboard in real time, 50+ real-time mics, and 100,000+ students watch in real time, meeting the needs of large classes , Open class and other scene requirements.

➤ In the retail industry, major retailers are trying to build private domain traffic pools, or build their own APPs for live streaming. The e-commerce live broadcast AUI Kit solution provides rich interactive live broadcast functions and supports multiple companies to quickly build live e-commerce business from 0 to 1.

➤ In the automotive industry, the new car release scene has attracted wide attention. The AUI Kit solution for enterprise live broadcast helps enterprises quickly build live broadcast room functions, create a live broadcast of blockbuster new car launches for global car enthusiasts, and guarantee high-quality playback experience under hundreds of thousands of concurrency .

➤ In the digital reading industry, in addition to traditional text reading, it has become a new trend to convert text scripts into short plays. Short video AUI Kit, based on the ability demands of on-demand scenes, designs and implements a one-stop short video production and playback solution .

➤ In immersive scenarios, the VR panoramic playback SDK uses FOV to transmit audio and video data, which can improve fluency while reducing bandwidth costs, and combine spatial audio to achieve the ultimate immersive audio and video experience.

It can be seen that the super energy of MediaBox is being released to many scenarios, and more industries and scenarios are in need of such a "sharp tool" for accelerating audio and video digitization to open up new opportunities and spaces.

d616655e13247dbd598dd022f487daac.png

Is the art test on the cloud the epitome of education digitization?

Alibaba Cloud's video cloud remote intelligent proctoring solution can be used as a microcosm of the effective exploration of "audio and video digitization" in the education industry.

As the "art test fever" continues to heat up, organizing large-scale offline exams not only requires a lot of manpower and material resources, but also requires candidates to bear the time and economic costs of long-distance offline exams. School, which exacerbates the burden. But "offline" is necessary for the special type of "art test".

MediaBox's scene solution saves all art candidates from having to deal with such suffering.

Through the AUI Kit solution for remote proctoring, Alibaba Cloud Video Cloud has cooperated with ecological partners to build a remote proctoring platform, which successfully supported the "Cloud Art Examination" for undergraduates at the China Academy of Art this year, and ensured that 40,000+ candidates at home and abroad can successfully complete the online art exam.

Quickly integrated in a low-code way, the remote proctor AUI Kit solution provides open source components and architecture design guidelines for the proctor and examinee, greatly reducing the access threshold. In terms of device-side coverage, it covers iOS/Andriod, web pages, DingTalk applications and WeChat applets to ensure "high ease of use" in remote proctoring scenarios.

Based on the underlying network of 3200+ nodes around the world and powerful media processing capabilities, the remote proctoring platform can carry 100,000+ candidates online at the same time, realizing the video delay of the proctor end within 1.5 seconds and the delay of 1-to-1 calls within 400ms, all-round satisfaction "Extremely easy to use" with high reliability, high concurrency, low latency, and high definition.

At the same time, in such a cloud examination scene, a new "scenario intelligence" has also emerged, and Alibaba Cloud Video Cloud has developed and launched an intelligent anti-cheating SDK.

Compared with the traditional anti-cheating, the video screenshots are analyzed in the cloud, which requires a large amount of analysis, takes a long time, and costs a lot. The intelligent anti-cheating SDK is a real-time detection on the terminal side, including human behavior detection, electronic product detection, clothing detection, environmental detection, etc., with fast reporting, faster speed and lower cost. Currently, it has covered multiple terminals such as Android/iOS/Web equipment.

Art exams are different from other online computer-based exams. It is required to include the entire drawing board and the candidate's side face in the host screen, but the algorithm tuning caused by different camera positions will be more complicated. The intelligent anti-cheating SDK provides a variety of end-side real-time detection capabilities in the form of atomic access, and can be dynamically enabled and flexibly selected according to the needs of different test scenarios, and customized to meet many types of online test scenarios .

It is precisely because of the in-depth understanding of the audio and video scenes in the industry that we can innovate to solve the pain points of the industry and open up new space for the scene.

MediaBox's ability to move towards broader industry scenarios and explore deeper scenarios is inseparable from the co-creation with industry ecological partners in the future. This LiveVideoStackCon, Alibaba Cloud Video Cloud will also release a new ecological cooperation plan, looking forward to joining hands with more ecological partners to open up the Vientiane world of audio and video digitization in the industry.

f61a32cb279612d2e99b3ad408c5f002.png

As an acceleration "weapon",

How does MediaBox realize the new upgrade of audio and video digitization in the industry?

July 28 afternoon

LiveVideoStackCon2023 Shanghai Station

Alibaba Cloud Video Cloud Session

A senior technical expert of Alibaba Cloud Intelligence gave a speech

"MediaBox: Re-acceleration of Industry Audio and Video Digitization"

Unleash the "digital productivity" of audio and video scenes!

2db1b21bee2ec2195b8b61d922fd8585.png

239663a413a27b0bfec3ab4bb76044cc.png

⬆️ Scan the QR code above to register for the session

Click to read the original text and make an appointment now

Guess you like

Origin blog.csdn.net/vn9PLgZvnPs1522s82g/article/details/131907494