

# ADVANCING NEUROMORPHIC COMPUTING FROM PROMISE TO COMPETITIVE TECHNOLOGY

Mike Davies Director, Neuromorphic Computing Lab | Intel Labs

March 27, 2019 Neuro-Inspired Computational Elements Workshop

# **LEGAL INFORMATION**

This presentation contains the general insights and opinions of Intel Corporation ("Intel"). The information in this presentation is provided for information only and is not to be relied upon for any other purpose than educational. Intel makes no representations or warranties regarding the accuracy or completeness of the information in this presentation. Intel accepts no duty to update this presentation based on more current information. Intel is not liable for any damages, direct or indirect, consequential or otherwise, that may arise, directly or indirectly, from the use or misuse of the information in this presentation.

Intel technologies' features and benefits depend on system configuration and may require enabled hardware, software or service activation. Learn more at intel.com, or from the OEM or retailer.

No computer system can be absolutely secure. No license (express or implied, by estoppel or otherwise) to any intellectual property rights is granted by this document. Intel, the Intel logo, Movidius, Core, and Xeon are trademarks of Intel Corporation in the United States and other countries.

\*Other names and brands may be claimed as the property of others

Copyright © 2019 Intel Corporation.

# Neuromorphic Computing Exploration Space



### Research Goals:

- **Broad class** of brain-inspired computation
- Efficient hardware implementations
- **Scalable** from small to large problems and systems

### Examples:

- Online and lifelong learning
- Learning without cloud assistance
- Learning with sparse supervision
- Understanding spatiotemporal data
- Probabilistic inference and learning
- Sparse coding/optimization
- Nonlinear adaptive control (robotics)
- Pattern matching with high occlusion
- SLAM and path planning
- Dynamical systems modeling

3

# Some Principles of Neural Computation



# Why Spikes? Findings from our research

- 1) Sparse communication in time optimizes energy efficiency (bits/J vs bits/s)
- 2) Spikes efficiently compute many rate-based models
- 3) Spikes provide efficient and natural processing of temporal data
- 4) Spikes support event-based algorithms that have nothing to do with rates
- 5) Spikes (surprisingly) efficiently implement **phasor networks**

In all examples studied so far, benefits vs conventional architectures increase with increasing problem scale



# **OUR LOIHI RESEARCH CHIP**



### **KEY PROPERTIES**

- 128 neuromorphic cores supporting up to 128k neurons and 128M synapses with an **advanced spiking neural network feature set**.
- Supports highly complex neural network topologies
- **Scalable on-chip learning** capabilities to support an unprecedented range of learning algorithms
- Fully digital **asynchronous** implementation
- Fabricated in Intel's 14nm FinFET process technology



#### Integrated Memory + Compute Neuromorphic Architecture

Davies et al, "Loihi: A Neuromorphic Manycore Processor with On-Chip Learning." IEEE Micro, Jan/Feb 2018.

# Mesh Operation: Fine-Grained Synchronization



Time step T begins.

Cores update dynamic neuron state and evaluate firing thresholds



Above-threshold neurons send spike messages to fanout cores

(Two neuron firings shown.)



All neurons that fire in time T route their spike messages to all destination cores.





# Learning with Synaptic Plasticity

- Local learning rules essential property for efficient scalability
- Rules derived by optimizing an emergent statistical objective
- Plasticity on wide range of time scales for
  - ✓ Immediate supervised (labelled) learning
  - ✓ Unsupervised self-organization
  - ✓ Working memory
  - ✓ Reinforcement-based delayed feedback



Learning rules for weight  $W_{x,y}$  may *only* access presynaptic state x and postsynaptic state y

**Reward spikes** may be used to distribute graded reward/punishment values to a particular set of axon fanouts



# Loihi's Trace-Based Programmable Learning



# Loihi Systems

Q4 2017 Wolf Mountain Remote Access 4 Loihi/Board

## Q2 2018

Nahuku Arria10 Expansion Board For cloud & local use 8-32 Loihi/Board

### Q3 2018 Kapoho Bay 1-2 Loihi DVS interface USB host interface

### Q2 2019 Pohoiki Springs Remote Access Up to 768 chips (100M neurons)











## Nx SDK Software Architecture



# INTEL NEUROMORPHIC RESEARCH COMMUNITY Collaborating to Accelerate Progress



44+ active projects, 50+ organizations Iceland Workshop (Sep 28 – Oct 2) attended by 62 researchers Winter Workshop (Feb 11-15) attended by 90+ researchers

# **INRC** Winter Workshop Attendance



# JOIN THE COMMUNITY E-mail: inrc\_interest@intel.com



# **SNN Algorithms Discovery and Development**





# **DNN-to-SNN conversion for keyword spotting**



- Loihi provides 5-10x lower energy than closest conventional DNN architecture
- Caveats: batchsize=1 and reduced accuracy (90.6% SNN vs 92.7% DNN)

Results from: Blouw et al, "Benchmarking Keyword Spotting Efficiency on Neuromorphic Hardware." arXiv:1812.01739



# Case Study: LASSO Sparse Coding

## The Spiking Locally Competitive Algorithm (S-LCA)



## Neural Network Structure

Inhibition



# Spiking LCA dynamics on Loihi



18

# Loihi compared to Core i7 CPU



\* Intel Core i7-4790 3.6GHz w/ 32GB RAM. FISTA solver: SPAMS http://spams-devel.gforge.inria.fr/ Performance results are based on testing as of December 2018 and may not reflect all publicly available security updates. No product can be absolutely secure.

# Loihi compared to Core i7 CPU (smaller problems)

### **CPU/Loihi Ratios**



\* Intel Core i7-4790 3.6GHz w/ 32GB RAM. FISTA solver: SPAMS http://spams-devel.gforge.inria.fr/ Performance results are based on testing as of December 2018 and may not reflect all publicly available security updates. No product can be absolutely secure.

20



# Next Steps: Generalizations & Learning

Unsupervised dictionary learning:

Lin, Tsung-Han, and Ping Tak Peter Tang. 2018. "Dictionary Learning by Dynamical Neural Networks." arXiv preprint. <u>https://arxiv.org/abs/1805.08952</u>.

Yijing Watkins and Garret Kenyon – upcoming NICE talk & poster

Generalization to data manifold learning:

Pehlevan, Cengiz. 2019. "A Spiking Neural Network with Local Learning Rules Derived From Nonnegative Similarity Matching." arXiv preprint. <u>https://arxiv.org/abs/1902.01429</u>.

Hierarchical LCA for adversarial-robust inference:

Jacob M Springer, et al. "Classifiers Based on Deep Sparse Coding Architectures are Robust to Deep Learning Transferable Examples." arXiv preprint. <u>https://arxiv.org/abs/1811.07211</u>



# Spike-based LSTMs – "LSNNs"

## Simple adaptive spiking model achieves LSTM-level accuracy

- SNN reservoir augmented with adaptive neurons ۲
- Thresholds rise on each spike, decay exponentially ۲ Highly energy-efficient adaptation
- Trained offline with BPTT (TensorFlow) ۲
- Achieves 96% accuracy on sequential ٠ MNIST, same as equivalent LSTMs
- **Runs on Loihi today with 94% accuracy** ۲

[Bellec et al, arXiv preprint arXiv:1803.09574]



23

# "Neuromorphic Backpropagation"

### Numerous promising approaches:

- Eligibility Propagation Bellec, et al (TU Graz), on arxiv Jan 25, 2019.
- Surrogate Gradient Learning Mostafa, Neftci, Zenke (Tue/Wed), on arxiv Jan 28, 2019.
- Dendritic cortical microcircuits approximate the backpropagation algorithm J Sacramento, et al. NeurIPS 2018.

Soon we will be able to train **multi-layer** and **recurrent LSNNs** with local threefactor learning rules on Loihi.



[Bellec et al, arXiv preprint arXiv:1901.09049]

## intel

# Adaptive Control of a Robot Arm Using Loihi

SNN adaptive dynamic controller implemented on Loihi allows a robot arm to adjust in real time to nonlinear, unpredictable changes in system mechanics<sup>[1][2]</sup>.

Result outperforms standard PD & PID control algorithms.





Different control methods adapting to a gradual, linear increase in friction, over the course of 50 runs. This simulates ~3 years of wear over the course of 16.67 minutes of run time, a 90K times speed up. Only 20K neurons on Loihi is able to successfully cope with this perturbation.

[1] DeWolf, T., Stewart, T. C., Slotine, J. J., & Eliasmith, C. (2016, November). A spiking neural model of adaptive arm control. In *Proc. R. Soc. B* (Vol. 283, No. 1843, p. 20162134). The Royal Society.

[2] Eliasmith, "Building applications with next generation neuromorphic hardware." *NICE Workshop 2018* 



# Solving Constraint Satisfaction Problems

**SNN with noise** stochastically searches to find the minimum energy solution:

3.m





WIP: Self-checking validation network to stop execution when solutions are found.



# Graph Search – Path Planning

Runtime comparison to best Djikstra optimizations:

- Neuromorphic:  $O(L \cdot \sqrt{V})$
- Standard: O(E)

For most nontrivial problems:

- L<<E
- V<<E

Neuromorphic solution uses fine-grain parallelism an temporal wavefront-driven computation to potentially provide great performance gains for large problems.

## **Robot Motion**



### Loihi Representation



Based on Ponulak F., Hopfield J.J. Rapid, parallel path planning by propagating wavefronts of spiking neural activity. Front. Comput. Neurosci. 2013. V. 7. Article № e98. DARPA SDR Site B (Data from Radish Robotics Dataset)

# Graph Search on Nahuku (32-chip Loihi System)

## Increasing core parallelism with fixed chip count



#### **Execution Time per Timestep**

2

# Searching Small World Networks with Loihi

Watts-Strogatz network model with rewiring probability 20%.



Runtime for 100,000 nodes

**Runtime for 10 edges per node** 

\* Intel Xeon 6136 3.00 GHz w/ 32GB RAM.

\*\* with <u>NetworkX</u> graph analytics library

Performance results are based on testing as of December 2018 and may not reflect all publicly available security updates. No product can be absolutely secure.



# **Olfaction-Inspired One Shot Learning**

Sensory Neurons

Olfactory System Olfactory Bulb Olfactory Cortex Olfactory Cortex

### Spatiotemporal Attractor Model



# **Outperforms Conventional Algorithms**

## Provides average of **8% accuracy improvement** vs deep autoencoder

**40x more data efficient** learning vs backpropagation

Supports **online learning** (robust to catastrophic forgetting)



### **Classification Accuracy**

## **Excellent Scaling to Larger Network Sizes**



Performance results are based on testing as of December 2018 and may not reflect all publicly available security updates. No product can be absolutely secure.

# **Phasor Neural Networks**

## An emerging paradigm for SNN computation?

Idea: Represent neural activities with complex numbers

Offer benefits for associative memory capacity, backprop gradient propagation, VSA factoring, among others.

### Many SNN implementation benefits:

- Simple LIF implementation w/ different E/I decays
- Constant guaranteed sparse activity
- Synaptic delays provide non-trivial phase transformations
- Fast, bounded response time vs rate coding

Sparse SNN phasor generalization of Hopfield network provides up to **6x higher information per synapse** vs real-valued Hopfield network.

EP Frady, F Sommer, "Robust computation with rhythmic spike patterns." arXiv:1901.07718



# **The Frontier Ahead**

Advancing from Compelling Example Results to Valuable Real-World Technologies

Adaptive

• Inference and learning of sparse feature representations

Low Latency

- Video and speech recognition
- Event-based camera processing
- Chemosensing

Low Energy

- Adaptive dynamic control
- Anomaly detection for security and industrial monitoring
- Optimization: Constraint Satisfaction, QUBO, Convex optimization
- Autonomy: SLAM, Planning, closedloop behavior

Batch Size = 1



High Cost

## Thank You!



Email inrc\_interest @ intel.com for more information