New 'bandit' algorithm uses light for better bets

August 21, 2023

How does a gambler maximize winnings from a row of slot machines? This is the inspiration for the "multi-armed bandit problem," a common task in reinforcement learning in which "agents" make choices to earn rewards. Recently, an international research team led by Hiroaki Shinkawa at the University of Tokyo developed an extended photonic reinforcement learning scheme that moves from the static bandit problem towards a more challenging dynamic environment. This study was published in Intelligent Computing.

from Tech Xplore - electronic gadgets, technology advances and research news https://ift.tt/jRZI7Gl

Search This Blog

News for All

New 'bandit' algorithm uses light for better bets

Comments

Post a Comment

Popular posts from this blog

Space-based experiments show wax-filled heat sinks keep electronics cooler for longer

AI designs new underwater gliders with shapes inspired by marine animals

Shane Warne's body prepared for autopsy ahead of repatriation