New York Tech Media
  • News
  • FinTech
  • AI & Robotics
  • Cybersecurity
  • Startups & Leaders
  • Venture Capital
No Result
View All Result
  • News
  • FinTech
  • AI & Robotics
  • Cybersecurity
  • Startups & Leaders
  • Venture Capital
No Result
View All Result
New York Tech Media
No Result
View All Result
Home AI & Robotics

A desert robot depicts AI’s vast opportunities

New York Tech Editorial Team by New York Tech Editorial Team
December 27, 2021
in AI & Robotics
0
A desert robot depicts AI’s vast opportunities
Share on FacebookShare on Twitter

When Hongzhi Gao was young, he lived with his family in Gansu, a province located in the center of northern China by the Tengger Desert. Thinking back to his childhood, he recalls the constant, steady wind of dirt outside their house, and that during most months of the year it didn’t take more than a minute after stepping outside before sand would fill any empty space and creep into his pockets, boots, and his mouth. The monotony of the desert stuck in his head for years, and at university he turned that memory into an idea to build a machine that can bring plant life to the desert landscape.

Efforts to stop desertification—the process by which fertile land becomes desert—have been primarily focused on expensive manual solutions. Hongzhi designed a robot with deep learning technology to automate the process of tree planting: from identifying optimal spots to planting tree seedlings to watering. Despite having no experience with AI, as an undergraduate student Hongzhi used Baidu’s deep learning platform PaddlePaddle to stitch together different modules to build a robot with better object detection capability than similar machines already available in the market. It took less than one year for Hongzhi and his friends to spin up the final product and put it to work.

Hongzhi’s desert robot serves as a telling example of the increasing accessibility of artificial intelligence.

Today, more than four million developers are using Baidu’s open source AI technology to build solutions that can improve the lives of people in their communities, and many of them have little to no technical expertise in the field. “Within the next decade, AI will be the source of changes taking place across every fabric of our society, transforming how industries and businesses operate. The technology will expand the human experience by taking us on a deeper dive into the digital world,” said Baidu CEO Robin Li at Baidu Create 2021, an AI developer conference.

As we enter a new chapter in the evolution of AI, Haifeng Wang, CTO of Baidu, identified two key trends that underpin the industry’s path forward: AI will continue to mature and increase its technical complexity. And at the same time, the cost of deployment and barrier to entry will decrease—benefiting both enterprises building AI-powered solutions at scale and software developers exploring the world of AI.

Merging of knowledge and data with deep learning

The integration of knowledge and data with deep learning has significantly improved the efficiency and accuracy of AI models. Since 2011, Baidu’s AI infrastructure has been acquiring and integrating new information into a large-scale knowledge graph. Currently, this knowledge graph has more than 550 billion facts, covering all aspects of everyday life, as well as industry-specific topics, including manufacturing, pharmaceuticals, law, financial services, technology, and media and entertainment.

This knowledge graph and the massive data points together make up the building blocks of Baidu’s newly released pre-trained language model PCL-BAIDU Wenxin (version ERINIE 3.0 Titan). The model outperforms other language models without knowledge graphs on 60 natural language processing (NLP) tasks, including reading comprehension, text classification, and semantic similarity.

Learnings across modalities

Cross-modal learning is a new area of AI research that seeks to improve machines’ cognitive understanding and to better mimic the adaptive behavior of humans. Examples of research efforts in this area include automatic text-to-image synthesis, where a model is trained to generate images from text descriptions alone, as well as algorithms built to understand visual content and express that understanding with words. The challenge with these tasks is for the machines to build semantic connections across different types of datasets (e.g., images, text) and understand the interdependencies between them.

The next step for AI is merging AI technologies like computer vision, speech recognition, and natural language processing to create a multi-modal system.

On this front, Baidu has rolled out a variant of its NLP models that ties together language and visual semantic understanding. Examples of real-world applications for this type of model include digital avatars that can perceive their surroundings like human beings and handle customer support for businesses, and algorithms that can “draw” pieces of art and compose poems based on their understanding of the generated artworks.

There are even more creative, impactful potential outcomes for this technology. The PaddlePaddle platform can build semantic connections across vision and language, which led a group of master’s students in China to create a dictionary to preserve endangered languages in regions like Yunnan and Guangxi by more easily translating them into simplified Chinese.

AI integration across software and hardware, and into industry-specific use cases

As AI systems are applied to solve increasingly complex and industry-specific problems, a greater emphasis is placed on optimizing the software (deep learning framework) and hardware (AI chip) as a whole, instead of optimizing each individually, taking into consideration factors such as computing power, power consumption, and latency.

Further, tremendous innovation is taking place at the platform layer of Baidu’s AI infrastructure, where third-party developers are using the deep learning capabilities to build new applications tailored to specific use cases. The PaddlePaddle platform has a series of APIs to support AI applications in newer technologies such as quantum computing, life sciences, computational fluid mechanics, and molecular dynamics.

AI has practical uses as well. For example, in Shouguang, a small city in Shandong Province, AI is being used to streamline the fruit and vegetable industry. It takes only two people and one app to manage dozens of vegetable sheds.

And this is notable says Wang, “Despite the increased complexity of AI technology, open-source deep learning platform brings together the processor and applications like an operating system, reducing barriers to entry for companies and individuals looking to incorporate AI into their business.”

Reduced barrier to entry for developers and end users

On the technology front, pre-training large models like PCL-BAIDU Wenxin (version ERNIE 3.0 Titan) have solved many common bottlenecks faced by traditional models. For instance, these general-purpose models have helped lay the foundation for running different types of downstream NLP tasks, such as text classification and question-answering, in one consolidated place, whereas in the past, each type of task would have to be solved by a separate model.

PaddlePaddle also has a series of developer-friendly tools, such as model compression technologies to tweak the general-purpose models to fit more specific use cases. The platform provides an officially supported library of industrial-grade models with more than 400 models, ranging from large to small, which retain only a fraction of the general-purpose models’ size but can achieve comparable performance, reducing model development and deployment costs.

Today, Baidu’s open source deep learning technology supports a community of more than four million AI developers who have collectively created 476,000 models, contributing to the AI-driven transformation of 157,000 businesses and institutions. The examples enumerated above are a result of innovations happening across all layers of the Baidu AI infrastructure, which integrates technologies such as voice recognition, computer vision, AR/VR, knowledge graphs, and pre-training large models that are one step closer to perceiving the world like humans.

In its current state, AI has reached a level of maturity that allows it to do amazing tasks. For example, the recent launch of Metaverse XiRang would not have been possible without PaddlePaddle’s platform to create digital avatars for participants around the world to connect from their devices. Further, future breakthroughs in areas like quantum computing could significantly improve the performance of metaverses. This goes to show how Baidu’s different offerings are inter-woven and inter-dependent.

In a few years, AI will be near the core of our human experience. It will be to our society what steam power, electricity, and the internet were to previous generations. As AI becomes more complex, developers like Hongzhi will be working more in the capacity of artists and designers, given the creative freedom to explore use cases previously considered only theoretically possible. The sky is the limit.

This content was produced by Baidu. It was not written by MIT Technology Review’s editorial staff.

Credit: Source link

Previous Post

SPACs, Private Equity, Venture Capital See Jump in Investor Capital

Next Post

Biden is betting $1 billion from taxpayers on a hydrogen startup — Quartz

New York Tech Editorial Team

New York Tech Editorial Team

New York Tech Media is a leading news publication that aims to provide the latest tech news, fintech, AI & robotics, cybersecurity, startups & leaders, venture capital, and much more!

Next Post
Biden is betting $1 billion from taxpayers on a hydrogen startup — Quartz

Biden is betting $1 billion from taxpayers on a hydrogen startup — Quartz

  • Trending
  • Comments
  • Latest
Meet the Top 10 K-Pop Artists Taking Over 2024

Meet the Top 10 K-Pop Artists Taking Over 2024

March 17, 2024
Panther for AWS allows security teams to monitor their AWS infrastructure in real-time

Many businesses lack a formal ransomware plan

March 29, 2022
Zach Mulcahey, 25 | Cover Story | Style Weekly

Zach Mulcahey, 25 | Cover Story | Style Weekly

March 29, 2022
How To Pitch The Investor: Ronen Menipaz, Founder of M51

How To Pitch The Investor: Ronen Menipaz, Founder of M51

March 29, 2022
Japanese Space Industry Startup “Synspective” Raises US $100 Million in Funding

Japanese Space Industry Startup “Synspective” Raises US $100 Million in Funding

March 29, 2022
UK VC fund performance up on last year

VC-backed Aerium develops antibody treatment for Covid-19

March 29, 2022
Startups On Demand: renovai is the Netflix of Online Shopping

Startups On Demand: renovai is the Netflix of Online Shopping

2
Robot Company Offers $200K for Right to Use One Applicant’s Face and Voice ‘Forever’

Robot Company Offers $200K for Right to Use One Applicant’s Face and Voice ‘Forever’

1
Menashe Shani Accessibility High Tech on the low

Revolutionizing Accessibility: The Story of Purple Lens

1

Netgear announces a $1,500 Wi-Fi 6E mesh router

0
These apps let you customize Windows 11 to bring the taskbar back to life

These apps let you customize Windows 11 to bring the taskbar back to life

0
This bipedal robot uses propeller arms to slackline and skateboard

This bipedal robot uses propeller arms to slackline and skateboard

0
Coffee Nova’s $COFFEE Token

Coffee Nova’s $COFFEE Token

May 29, 2025
Money TLV website

BridgerPay to Spotlight Cross-Border Payments Innovation at Money TLV 2025

May 27, 2025
The Future of Software Development: Why Low-Code Is Here to Stay

Building Brand Loyalty Starts With Your Team

May 23, 2025
Tork Media Expands Digital Reach with Acquisition of NewsBlaze and Buzzworthy

Creative Swag Ideas for Hackathons & Launch Parties

May 23, 2025
Tork Media Expands Digital Reach with Acquisition of NewsBlaze and Buzzworthy

Strengthening Cloud Security With Automation

May 22, 2025
How Local IT Services in Anderson Can Boost Your Business Efficiency

Why VPNs Are a Must for Entrepreneurs in Asia

May 22, 2025

Recommended

Coffee Nova’s $COFFEE Token

Coffee Nova’s $COFFEE Token

May 29, 2025
Money TLV website

BridgerPay to Spotlight Cross-Border Payments Innovation at Money TLV 2025

May 27, 2025
The Future of Software Development: Why Low-Code Is Here to Stay

Building Brand Loyalty Starts With Your Team

May 23, 2025
Tork Media Expands Digital Reach with Acquisition of NewsBlaze and Buzzworthy

Creative Swag Ideas for Hackathons & Launch Parties

May 23, 2025

Categories

  • AI & Robotics
  • Benzinga
  • Cybersecurity
  • FinTech
  • New York Tech
  • News
  • Startups & Leaders
  • Venture Capital

Tags

3D bio-printing acoustic AI Allseated B2B marketing Business carbon footprint climate change coding Collaborations Companies To Watch consumer tech crypto cryptocurrency deforestation drones earphones Entrepreneur Fetcherr Finance Fintech food security Investing Investors investorsummit israelitech Leaders LinkedIn Leaders Metaverse news OurCrowd PR Real Estate reforestation software start- up Startups Startups On Demand startuptech Tech Tech leaders technology UAVs Unlimited Robotics VC
  • Contact Us
  • Privacy Policy
  • Terms and conditions

© 2024 All Rights Reserved - New York Tech Media

No Result
View All Result
  • News
  • FinTech
  • AI & Robotics
  • Cybersecurity
  • Startups & Leaders
  • Venture Capital

© 2024 All Rights Reserved - New York Tech Media