Monday, December 11, 2017
Search
  
Submit your own News for
inclusion in our Site.
Click here...
Breaking News
LG Tests LTE-based Safety Technology for Connected, Self-driving Cars
Apple Said To Buy Shazam
Intel Uses Cobalt Interconnect for 10nm, Global Foundries Detail EUV Lithography for 7nm
LG Display Starts OLED Light Panel Production
Toshiba Moving Closer to Deal With Western Digital
YouTube Said to Launch Music Subscription Service
NVIDIA Says New TITAN V GPUs Transform the PC into AI Supercomputer
Toshiba Launches First 14TB HDD with Conventional Magnetic Recording
Active Discussions
Which of these DVD media are the best, most durable?
How to back up a PS2 DL game
Copy a protected DVD?
roxio issues with xp pro
Help make DVDInfoPro better with dvdinfomantis!!!
menu making
Optiarc AD-7260S review
cdrw trouble
 Home > News > General Computing > Microso...
Last 7 Days News : SU MO TU WE TH FR SA All News

Wednesday, June 14, 2017
Microsoft AI Masters Pac-Man


To master the game Ms. Pac-Man, Microsoft researchers have created an artificial intelligence-based system that learned how to get the maximum score on the legendary video game Ms. Pac-Man, using a divide-and-conquer method that could have broad implications for teaching AI agents to do complex tasks that augment human capabilities.

The team from Maluuba, a Canadian deep learning startup acquired by Microsoft earlier this year, used a branch of AI called reinforcement learning to play the Atari 2600 version of Ms. Pac-Man perfectly. Using that method, the team achieved the maximum score possible of 999,990.

To get the high score, the team divided the large problem of mastering Ms. Pac-Man into small pieces, which they then distributed among AI agents. That's similar to some theories of how the brain works, and it could have broad implications for teaching AIs to do complex tasks with limited information.

The method, which the Maluuba team calls Hybrid Reward Architecture, used more than 150 agents, each of which worked in parallel with the other agents to master Ms. Pac-Man. For example, some agents got rewarded for successfully finding one specific pellet, while others were tasked with staying out of the way of ghosts.

Then, the researchers created a top agent who took suggestions from all the agents and used them to decide where to move Ms. Pac-Man. The top agent took into account how many agents advocated for going in a certain direction, but it also looked at the intensity with which they wanted to make that move. For example, if 100 agents wanted to go right because that was the best path to their pellet, but three wanted to go left because there was a deadly ghost to the right, it would give more weight to the ones who had noticed the ghost and go left.

Harm Van Seijen, a research manager with Maluuba who is the lead author of a new paper about the achievement, said the best results were achieved when each agent acted very egotistically - for example, focused only on the best way to get to its pellet - while the top agent decided how to use the information from each agent to make the best move for everyone.

Rahul Mehrotra, a program manager at Maluuba, said figuring out how to win these types of videogames is actually quite complex, because of the huge variety of situations you can encounter while playing the game.

With reinforcement learning, an agent gets positive or negative responses for each action it tries, and learns through trial and error to maximize the positive responses, or rewards.

An AI-based system that uses supervised learning would learn how to come up with a proper response in a conversation by feeding it examples of good and bad responses. A reinforcement learning system, on the other hand, would be expected to learn appropriate responses from only high-level feedback, such as a person saying she enjoyed the conversation - a much more difficult task.

AI experts believe reinforcement learning could be used to create AI agents that can make more decisions on their own, allowing them to do more complex work and freeing up people for even more high-value work.



Previous
Next
Pioneer's Flagship Se-Monitor5 Hi-Res Headphones Released        All News        Nokia Unveils the World's Fastest Routers
Western Digital's SanDisk Subsidiaries Seek Injunctive Relief Against Toshiba in the Superior Court of California     General Computing News      Nokia Unveils the World's Fastest Routers

Get RSS feed Easy Print E-Mail this Message

Related News
IBM Says New POWER9-based AC922 Power Systems Offer 4x Deep-learning Framework Performance Over x86
IBM Scientists Demonstrate 10x Faster Machine Learning Using GPUs
Microsoft Plans Expansion of Redmond Campus
Microsoft Expands Deal With SAP to Use and Sell More of Each Other's Cloud Services
Top Black Friday deals from Microsoft
Microsoft Cloud Continues to Grow, Powers First Quarter Results
Intel Advances Artificial Intelligence With Nervana Neural Network Processor
AMD, Intel, ARM, IBM and Others Support the Open Neural Network Exchange Format for AI
Microsoft to Buy Wind Energy From GE's new Wind Farm in Ireland
Microsoft Brings the Edge Browser to iOS and Android
Intel's New Loihi Self-Learning Chip Promises to Accelerate Artificial Intelligence
Microsoft Announces First Windows S Devices, Brings cloud, AI and Mixed Reality to Businesses

Most Popular News
 
Home | News | All News | Reviews | Articles | Guides | Download | Expert Area | Forum | Site Info
Site best viewed at 1024x768+ - CDRINFO.COM 1998-2017 - All rights reserved -
Privacy policy - Contact Us .