FY2016 Annual Report

Neural Computation Unit
Professor Kenji Doya Doya Unit Photo

Abstract

The Neural Computation Unit pursues the dual goals of developing robust and flexible learning algorithms and elucidating the brain’s mechanisms for robust and flexible learning. Our specific focus is on how the brain realizes reinforcement learning, in which an agent, biological or artificial, learns novel behaviors in uncertain environments by exploration and reward feedback. We combine top-down, computational approaches and bottom-up, neurobiological approaches to achieve these goals.

In FY2016, we obtained two major external grants: Kakenhi project on Artificial Intelligence and Brain Science and Post-K Supercomputing project on Brain and Artificial Intelligence.

In the Kakenhi project on Artificial Intelligence and Brain Science, Prof. Doya serves as the project leader to orchestrate research in 11 laboratories across the country, from both AI and neuroscience fields.

Prof. Doya also leads the Post-K supercomputing project on Brain and Artificial Intellignece, a joint effort of seven groups working on neural data analysis, neural network modeling, and brain-inspired AI applications to utilize Japan's next flagship supercomputer, about 100 times performance of the present K supercomputer, to launch in 2020.

Our unit also participated in Japan's major brain science project, Brain/MINDS, in developing neural data analysis pipelines and modeling methodologies.

1. Staff

Dynamical Systems Group

Ildefons Magrans de Abril, Staff Scientist
Carlos Enrique Gutierrez, Postdoctoral Scholar
Hiromichi Tsukada, Postdoctoral Scholar
Jessica Verena Schulze, OIST Student
Kosuke Yoshida, Special Research Student
Junichiro Yoshimoto, Visiting Researcher (NAIST)

Systems Neurobiology Group

Makoto Ito, Group Leader
Akihiro Funamizu, Postdoctoral Scholar
Kazumi Kasahara, JSPS Research Fellow
Katsuhiko Miyazaki, Staff Scientist
Kayoko Miyazaki, Staff Scientist
Hiroaki Hamada, OIST Student
Tomohiko Yoshizawa, Special Research Student

Adaptive Systems Group

Qiong Huang, OIST Student
Farzana Rahman, OIST Student
Chris Reinke, OIST Student
Jiexin Wang, Special Research Student
Paavo Parmas, OIST Student
Tadashi Kozuno, OIST Student
Eiji Uchibe, Visiting researcher (ATR)

Administrative Assistant / Secretary

Emiko Asato
Kikuko Matsuo

2. Collaborations

Development of reinforcement learning technology for large scale systems and its application to ICT systems

Type of collaboration: Joint research
Researchers:
- Seishi Okamoto, Fujitsu Laboratories Ltd.
- Naoki Sashida, Fujitsu Laboratories Ltd.
- Tomotake Sasaki, Fujitsu Laboratories Ltd.

3. Activities and Findings

3.1 Kakenhi Project on Artificial Intelligence and Brain Science

1) In applying deep neural networks to reinforcement learning, standard methods like Deep Q-Network (DQN) take a method to assure stability at the compromise of data-efficiency. We generalized the method in the framework of "approximate value iteration" and mathematically analyzed the speed of convergence. As a result, we derived a more data-efficient algorithm and verified its performance in benchmark tasks.

2) In order to elucidate the information coding in the basal ganglia for reinforcement learning, we performed cell-type specific optical recording of neurons in the striosome compartment of the striatum. In an odor conditioning task, we discovered that striosome neurons acquire reward predictive activities, supporting the hypothesis that they encode state value functions in reinforcement learning.

3) To understand the mechanism of predictive coding in the cerebral cortex, we started to develop a novel behavioral paradigm for mice and calcium imaging system using a prism lens for simultaneously recording neurons in multiple cortical layers.

Striosome Imaging

Figure 1: Endoscopic calcium imaging of striosome neurons.

3.2 Post-K Supercomputing Project on Brain and Artificial Intelligence

We constructed an “integrate-and-fire” neural network model of the basal ganglia circuit based on the mean-field approximation model by Lineard & Girarad (2014), which was based on the anatomical and physiological data of the basal ganglia. The model was coded by a network description language PyNEST and scaled as 1/20,000 of the macaque monkey brain, with 2,644 neurons in the input site, the striatum.

In parallel simulation by NEST, we succeeded in reproducing the firing rates of each region in normal and pharmacologically manipulatesstates. Furthermore, by taking this network as a single “channel” and laterally connecting multiple channels, we observed that only a single channel produced its output for a given input, which was a realization of action selection function.

Girard 2016 model

Figure 2: Spiking neural network model of the basal ganglia.

3.3 Brain/MINDS Project on Multi-scale Neural Modeling

The aim of this research and development is to maximally utilize the data obtained in the Brain/MIDNS program at macroscopic, mesoscopic and microscopic levels for building computational models of the brain and to integrate the models at different levels for understanding and predictions of cognitive and behavioral functions starting from the molecular and cellular levels through the circuit mechanisms. We performed the following research and developments:

1) Automatic estimation of model parameters: We explored the Bayesian connectivity inference framework to integrate the connections estimated by fiber-tracking using diffusion MRI data as the prior probability and the resting-state functional MRI data for the likelihood and ran benchmarks of multiple methods. We also produced data processing pipeline to apply those methods for the marmoset MRI data obtained at RIKEN.

2) Multi-scale data integration: Toward the goal of evaluation and optimization of fiber-tracking algorithms for diffusion MRI data using the neural tracer data as the reference, we built trace image processing pipeline in close collaboration with researchers at RIKEN and Kyoto University. We also performed evaluation of fiber-tracking algorithms.

3) Model building and performance validation: We built a whole-brain network model based on the connectivity matrix estimated by fiber tracking based on the marmoset diffusion MRI data and analyzed how the model could reproduce the functional connectivity given by resting-state functional MRI. We clarified key parameters that affect the network behaviors.

DTI-fMRI integration

Figure 3: Simulation of network model based on anatomical connectivity data from diffusion MRI and its comparison with functional connectivity data from resting-state functional MRI.

3.4 Other Research Developments

We published papers from our previous Kakenhi project on Prediction and Decision Making, regarding the neural mechanisms of mental simulation (Fermin et al., Scientific Reports, 2016; Funamizu et al., Nature Neuroscience, 2016).

We also published papers on efficient reinforcement learning algorithms (Elfwing et al., 2016; Wang et al., 2017).

4. Publications

4.1 Journals

Elfwing, S., Uchibe, E., & Doya, K. (2016). From free energy to expected energy: Improving energy-based value function approximation in reinforcement learning. Neural Networks, 84, 17-27. doi:http://dx.doi.org/10.1016/j.neunet.2016.07.013
Fermin, A. S. R., Yoshida, T., Yoshimoto, J., Ito, M., Tanaka, S. C., & Doya, K. (2016). Model-based action planning involves cortico-cerebellar and basal ganglia networks. Nature Scientific Reports, 6(31378). doi:10.1038/srep31378
Funamizu, A., Kuhn, B., & Doya, K. (2016). Neural substrate of dynamic Bayesian inference in the cerebral cortex. Nature Neuroscience. doi:http://dx.doi.org/10.1038/nn.4390
Okamoto, Y., Okada, G., Tanaka, S., Miyazaki, K., Miyazaki, K., Doya, K., & Yamawaki, S. (2016). The role of serotonin in waiting for future rewards in depression. International Journal of Neuropsychopharmacology, 19, 33-33.
Shimizu, Y., Doya, K., Okada, G., Okamoto, Y., Takamura, M., Yamawaki, S., & Yoshimoto, J. (2016). Depression severity and related characteristics correlate significantly with activation in brain areas selected through machine learning. International Journal of Neuropsychopharmacology, 19, 135-136.
Shouno, O., Tachibana, Y., Nambu, A., & Doya, K. (2017). Computational model of recurrent subthalamo-pallidal circuit for generation of parkinsonian oscillations. Frontiers in Neuroanatomy, 11. doi:http://dx.doi.org/10.3389/fnana.2017.00021
Wang, J. X., Uchibe, E., & Doya, K. (2017). Adaptive baseline enhances EM-based policy search: Validation in a view-based positioning task of a smartphone balancer. Frontiers in Neurorobotics, 11. doi:http://dx.doi.org/10.3389/fnbot.2017.00001
Yoshida, K., Yoshimoto, J., & Doya, K. (2017). Sparse kernel canonical correlation analysis for discovery of nonlinear interactions in high-dimensional data. BMC Bioinformatics, 18(1), 108. doi:http://dx.doi.org/10.1186/s12859-017-1543-x

4.2 Books and other one-time publications

Doya, K. (2016). Reinforcement learning as a model of intelligence. In T. Makino, T. Shibuya, & S. Shirakawa (Eds.), Reinforcement Learning From Now On (pp. 284-294). Tokyo: Morikita Publishing Co., Ltd.

4.3 Oral and Poster Presentations

Doya, K. (2016.09.08-10). Artificial Intelligence, Brain Science and Human Mind, Waters Edge, Colombo, Sri Lanka.
Doya, K. (2016). Brain Science and Artificial Intelligence: From Basic Science to Innovation. Naha, Okinawa: Okinawa Open Days 2016（OOD2016）.
Doya, K. (2016.11.22). Imaging the neural circuits for mental simulation and reward prediction, University of Toyama.
Doya, K. (2016.06.21). Introduction to reinforcement learning and Bayesian inference, OIST Seaside House, Onna-son, Okinawa.
Doya, K. (2016.9.26-27). Neural Circuit Mechanisms of Mental Simulation, Hebrew University of Jerusalem, Israel.
Doya, K. (2017.03.29). Coding of action and state values in the striatal compartments, Merida, Mexico.
Doya, K. (2017). From Circuit Architectures to Learning Mechanisms of the Cerebellum, the Basal Ganglia, and the Cerebral Cortex. Yokohama, Kanagawa: The64thJSAP Spring Meeting, 2017.
Doya, K., Elfwing, S., & Uchibe, E. (2016.10.08). Design, inference and evolution of reward functions for robots, University of London, London.
Doya, K., Miyazaki, K. W., & Miyazaki, K. (2016.09.28-30). Serotonin and the regulation of patience, Gatsby Computtional Neuroscience Unit, London, England.
Funamizu, A. (2016). Neural substrate of dynamic Bayesian inference in posterior parietal cortex. OIST, Okinawa, Japan.
Funamizu, A. (2016). Neural substrate of dynamic Bayesian inference in posterior parietal cortex. Pacifico Yokohama, Yokohama, Kanagawa, Japan.
Funamizu, A. (2016). Neural substrate of dynamic Bayesian inference in the cerebral cortex. New York University, New York, US.
Funamizu, A., Kuhn, B., & Doya, K. (2016.07.20). Neural substrate of dynamic Bayesian inference in posterior parietal cortex, Pacifico Yokohama, Yokohama, Kanagawa, Japan.
Gutierrez, C. E., Yoshimoto, J., & Doya, K. (2016). Community Detection and Mean Field Approximation for Dimension Reduction of Spiking Network Models. Suzuki Umetaro Hall RIKEN, Wako, Saitama, Japan: Advances in Neuroinformatics (AINI) 2016.
Hamada, H., Hikishima, K., Takata, N., Sakai, Y., Tanaka, K., & Doya, K. (2016). The difference of resting-state brain activities in awake and anesthetized states in mice. University of Vienna,Vienna, Austria: 5th Biennial Conference on Resting State and Brain Connectivity 2016.
Hamada, H., Sakai, Y., Takata, N., Hikishima, K., Tanaka, K., & Doya, K. (2016). Mapping Functional Whole-Brain Networks in an Awake State of Mice. Pacifico Yokohama, Yokohama, Kanagawa, Japan: The 39th Annual Meeting of the Japanese Neuroscience Society.
Ito, M. (2016). Extraction of brain macrostate and estimation of algorism by sparse modeling. Takeda Hall, Tokyo university, Tokyo, Japan: 4th research meeting of intiative for high-dimensional data-driven science through deepening of sparse modeling.
Ito, M. (2016). Hippcampal CA1 recording and macro-state extraction. Keio University, Tokyo: Public symposium of Sparse Modeling.
Ito, M., & Doya, K. (2016.07.22). Nonlinear dimension reduction of optically recorded activities of hippocampus CA1 pyramidal neurons revealed not only spatial but also motion information coding, Pacifico Yokohama, Yokohama, Kanagawa, Japan.
Ito, M., & Doya, K. (2016.11.13). A nonlinear unsupervised-learning method extracts spatial information and more from hippocampal population activity of freely-moving rats San Diego, CA, USA.
Ito, M., & Yoshizawa, T. (2016). Endoscope calcium imaging of rat hippocampus and mouse striatum by miniature microscope. 8th Optogenetics symposium. Mita campus, Keiou-Univ, Tokyo, Japan.
Magrans de Abril, I., Yoshimoto, J., & Doya, K. (2016). A strategy to infer hidden sources behind multiple neural recording data for neural circuit inference. Suzuki Umetaro Hall RIKEN, Wako, Saitama, Japan: 4th INCF Japan Node International Workshop Advances in Neuroinformatics 2016 and 14th INCF Nodes Workshop.
Qiong, H., Uchibe, E., & Doya, K. (2016). Emergence of communication among reinforcement learning agents under coordination environment. Cergy-Pontoise, Paris, France: IEEE ICDL-EPIROB 2016.
Reinke, C. (2016). Brain inspired temporal decision making algorithms & Research opportunities at the Okinawa Institute of Science and Technology. Frankfurt Institute for Advanced Studies.
Schulze, J. V. (2016). Functionally informed priors in a bayesian machine lerning approach to neural connectivity inference. the Universidad Catolica San Pablo, Arequipa, Peru: Machine learning summer school 2016
Schulze, J. V. (2016). Functionally Infotmend Priors in a Bayesian Machine Learning Approach to Neural Connectivtity Inference. Barcelona, Spain: WiML & NIPS.
Tsukada, H., Hamada, H., Nakae, K., Ishii, S., Hata, J., Okano, H., & Doya, K. (2016). Mathematical modeling and dynamical analysis using structural and functional connectivity. Suzuki Umetaro Hall RIKEN, Wako, Saitama, Japan: 4th INCF Japan Node International Workshop, Advances in Neuroinformatics 2016 and 14th INCF Nodes Workshop.
Tsukada, H., Hamada, H., Nakae, K., Ishii, S., Hata, J., Okano, H., & Doya, K. (2016.08.31). A mathematical modeling approach for structural and functional connectivity in MRI, OIST Seaside House, Onna-son,Okinawa, Japan.
Tsukada, H., Hamada, H., Nakae, K., Ishii, S., Hata, J., Okano, H., & Doya, K. (2017). Dynamical analysis and mathematical modeling using structural and functional connectivity data. OIST Seaside House,Onna, Okinawa, Japan.
Tsukada, H., Hamada, H., Nakae, K., Ishii, S., Hata, J., Okano, H., & Doya, K. (2017). Mathematical model using structural and functional connectivity data of marmoset. Hokkaido university, Sapporo, Hokkaido, Japan.
Tsukada, H., Hamada, H., Nakae, K., Ishii, S., Hata, J., Okano, H., & Doya, K. (2017). Whole brain model simulation based on diffusion and functional MRI data. IINO HALL, Chiyoda-KU, Tokyo: The second symposium of brain and mind.
Uchibe, E. (2016). Deep inverse reinforcement learning based on KL-control.Special research committee on embodiment cognitive science and real world applications (ECSRA). Graduate School of Engineering Science, Osaka University, Osaka, Japan.
Yoshimoto, J., Kannon, T., Amano, M., Nishioka, T., Usui, S., & Kaibuchi, K. (2016). KANPHOS Platform: A database for neural phosphoproteomics with quality control. Suzuki Umetaro Hall RIKEN, Wako, Saitama, Japan: The 4th INCF Japan Node International Workshop Advances in Neuroinformatics (AINI 2016).
Yoshizawa, T., Ito, M., & Doya, K. (2016). The activities of striatal patch neurons in classical conditioning. Rusutsu, Hokkaido, Japan: The 16th Winter Workshop on the Mechanism of Brain and Mind.
Yoshizawa, T., Ito, M., & Doya, K. (2016). The role of striatal patch neurons in reward-based learning. Pacifico Yokohama, Yokohama, Kanagawa, Japan: The 39th Annual Meeting of the Japan Neuroscience Society.
Yoshizawa, T., Ito, M., & Doya, K. (2016). The striatal striosome compartment encodes the value of sensory stimulus. San Diego Convention Center, San Diego, CA, USA: Neuroscience 2016.

5. Intellectual Property Rights and Other Specific Achievements

Nothing to report

6. Meetings and Events

6.1 Seminars

Computational model-based analysis of learning and memory: stress, genes and prediction

Date: May 11, 2016
Venue: OIST Campus Lab1
Speaker: Dr. Gediminas Luksys (University of Basel)

Learning word meaning and associations from co-occurrence data

Date: September 23, 2016
Venue: OIST Campus Lab1
Speaker: Dr. Stefan Evert (Friedrich-Alexander-Universität Erlangen-Nürnberg)

Is depression caused by a hyperactive habenula?

Date: November 29, 2016
Venue: OIST Campus Lab1
Speaker: Dr. Jonathan Roiser (UCL Institute of Cognitive Neuroscience (ICN))

Action Selection & Reinforcement Learning in animals & robots

Date: December 5, 2016
Venue: OIST Campus Lab1
Speaker: Dr. Benoit Girard (ISIR, UPMC/CNR)

Saccadic eye movements

Date: December 6, 2016
Venue: OIST Campus Lab1
Speaker: Dr. Benoit Girard (ISIR, UPMC/CNR)

Navigation strategies & Multiple learning systems

Date: December 7, 2016
Venue: OIST Campus Lab1
Speaker: Dr. Benoit Girard (ISIR, UPMC/CNR)

6.2 Events

Joint Workshop: Neuro-Computing, Bioinformatics, Mathematical modeling and Machine Learning

Date: July 4-6, 2016
Venue: OIST Capmpus, Semiar Room B250 & Seminar Room C210
Co-organizers:
- The Institute of ElectronicsInformation and Communication Engineers(IEICE)
- Information Processing Society of Japan
- IEEE Computational Intelligence Society Japan Chapter
- Japan Neural Network Society
Speaker:
- Dr. Yutaka Hirata (Chubu University)

Sponsored Research Meeting: "Comparison and Fusion of Artificial Intelligence and Brain Science" and "Next Generation of Kei Super Computer" joint workshop

Date: August 11-12, 2016
Venue: Shonan Village Center
Organizers:
- Grant-in Aid for Scientific Research on Innovative Areas, MEXT, JAPAN, Correspondence and Fusion of Artificial Intelligence and Brain Science
- Post K Exploratory Challenge, Big Brain-Data Analysis, Whole Brain-Scale Simulation, and Brain -Style AI Architecture
Speaker:
- Dr. Kenji Doya (OIST)
- Dr. Shin Ishii (Kyoto University)
- Dr. Masashi Sugiyama (University of Tokyo)
- Dr. Yutaka Matsuo (University of Tokyo)
- Dr. Masamichi Sakagami (Tamagawa University)
- Dr. Jun Igarashi (RIKEN)
- Dr. Tadashi Yamazaki (University of Electro-Communications)
- Dr. Koichi Takahashi (RIKEN)
- Dr. Hiroshi Yamakawa (Dowango)
- Dr. Tatsuya Harada (University of Tokyo)

Sponsored Research Meeting: The 1st Research Area Meeting Scientific Research on Innovative Areas: Artificial Intelligence and Brain Science

Date: December 21, 2016
Venue: National Center of Science
Organizer: Grant-in Aid for Scientific Research on Innovative Areas, MEXT, JAPAN, Correspondence and Fusion of Artificial Intelligence and Brain Science