Introduction to statistical models of neural spike train data
Lectuler: Hideaki Shimazaki; RIKEN Brain Science Institute, Email: shimazaki at brain.riken.jp
Course descreption:
This course will introduce statistical models of event data, i.e., Point process models, and their applications to the analysis of neural spike trains. The models include homogeneous and inhomogeneous (rate-modulated) Poisson models and various history-dependent non-Poisson models (e.g., a renewal model). We will also study a popular parametric model in the field known as the point process - generalized linear model (GLM), which offers a tractable scheme to include stimulus input or motor output signals in the model. The course then covers statistical inference based on these models. Neurophysiologists often relate stimulus or behavior with spike-rates of individual neurons obtained by repeated measurements. We will review non-parametric estimation of the time-varying spike-rate using a classical histogram or kernel smoother, and optimization of these methods under the assumption of a Poisson point process. Methods for parametric models (e.g., point process-GLM) will cover standard procedures such as maximum likelihood estimation and a test for the goodness-of-fit. Emphasis is on validation of the models. Students will be exposed to important conceptions such as model selection, resampling, and statistical tests. We will review how these statistical models are used to express a set of assumptions to test specific features of neural activity relevant for information processing in neural coding studies. Toward the end of the course, students will learn how to decode a stimulus input or motor output from neural activity. The course will introduce a state-space model of the point process-GLM, and a recursive Bayesian filter/smoother to decode these signals. Practical applications of the decoding methods in Neuroscience and neuroprosthetic studies will be reviewed.
In this course, you will learn:
point process theory, time-rescaling theorem, non-parametric density estimation, state-space model, model selection, smooth prior, Bayesian recursive filter, generalized linear model, maximum entropy model, higher-order ineteraction, introductory information geometry, Fisher information, Laplace approximation, expectation-maximization algorithm.
, maximum likelihood estimation, L2/L1 regularization
Prerequisite: The course is designed for stutdents with no previous experience with statistical analysis and neural data. A basic knowledge of calculus and statistics would help.
Lecture 0: Overview of the course, neural coding studies
The first lecture will introduce historical aspect of neural coding studies. Sensory systems.
You will learn why the statistical modeling studies become important.
You will learn:
Lecture 1: Neural spike data and Point processes
Poisson point process, memoryless property, exponential distribution,
conditional intensity function
$ \lambda = \int{P\ell^{3}}{3EI} + \frac{P\ell}{GkA} $
goodness-of-fit test, QQ-plot, KS-plot
- Brown, E. N., Kass, R. E., & Mitra, P. P. (2004). Multiple neural spike train data analysis: state-of-the-art and future challenges. Nature Neuroscience, 7(5), 456–461.
- Kass, R. E., Ventura, V., & Brown, E. N. (2005). Statistical issues in the analysis of neuronal data. Journal of neurophysiology, 94(1), 8–25. doi:10.1152/jn.00648.2004
Time-rescaling theorem
- Brown, E. N., Barbieri, R., Ventura, V., Kass, R. E., & Frank, L. M. (2002). The time-rescaling theorem and its application to neural spike train data analysis. Neural computation, 14(2), 325–46. doi:10.1162/08997660252741149
- Kass, R. E., & Ventura, V. (2001). A spike-train probability model. Neural Computation, 13(8), 1713–1720.
- Kass, R. E., & Raftery, A. E. (1995). Bayes Factors. J Am Stat Assoc, 90(430), 773–795.
- Brown, E. N., Frank, L. M., Tang, D., Quirk, M. C., & Wilson, M. A. (1998). A statistical paradigm for neural spike train decoding applied to position prediction from ensemble firing patterns of rat hippocampal place cells. J Neurosci, 18(18), 7411–7425.
Lecture: Density estimation
- Shimazaki, H., & Shinomoto, S. (2007). A method for selecting the bin size of a time histogram. Neural Computation, 19(6), 1503–1527. doi:10.1162/neco.2007.19.6.1503
- Shimazaki, H., & Shinomoto, S. (2010). Kernel bandwidth optimization in spike rate estimation. Journal of Computational Neuroscience, 29(1-2), 171–182. Retrieved from http://dx.doi.org/10.1007/s10827-009-0180-4
Further readings
- Kass, R. E., Ventura, V., & Cai, C. (2003). Statistical smoothing of neuronal data. Network-Computation in Neural Systems, 14(1), 5–15.
Bayesian statistics
Uncertainty = randomness in the system
Uncertainty = loss of information
Lecture: State-space model
Limitation
Higher-order interactions
Sparseness and higher-order interactions
- Ohiorhenuan, I. E., Mechler, F., Purpura, K. P., Schmid, A. M., Hu, Q., & Victor, J. D. (2010). Sparse coding and high-order correlations in fine-scale cortical networks. Nature, 466(7306), 617–621. doi:10.1038/nature09178
Time-varying higher-order interactions
- Shimazaki, H., Amari, S.-I., Brown, E. N., & Grun, S. (2009). State-space analysis on time-varying correlations in parallel spike sequences. IEEE International Conference on Acoustics Speech and Signal Processing (2009) (pp. 3501–3504). Ieee. doi:10.1109/ICASSP.2009.4960380
- Shimazaki, H., Amari, S., Brown, E. N., & Grün, S. (2012). State-Space Analysis of Time-Varying Higher-Order Spike Correlation for Multiple Neural Spike Train Data. (O. Sporns, Ed.)PLoS Computational Biology, 8(3), e1002385. doi:10.1371/journal.pcbi.1002385
- Kass, R. E., Kelly, R. C., & Loh, W.-L. (2011). Assessment of Synchrony in Multiple Neural Spike Trains Using Loglinear Point Process Models. The annals of applied statistics, 5(2B), 1262–1292. doi:10.1214/10-AOAS429
Recent topics of Neuroscience
Furter reading for trial-by-trial variability
- Arieli et al. Dynamics of Ongoing Activity: Explanation of thelarge variablity in evoked cortical responses. Science 1996
- Simultaneous recordings from cat visual cortex of VSD, LFP and spikes. The work shows the variabilities across trials similary appears in all recordings: the ongoing activity is consistent in different spatial scales. A simple linear model, initial state of VSD + response, predicted a measured response at each trial.
- Tsodyks, M., Kenet, T., Grinvald, A., & Arieli, A. (1999). Linking Spontaneous Activity of Single Cortical Neurons and the Underlying Functional Architecture. Science, 286(December), 1943–1946.
- Anesthetized cat 17&18. Spontaneous: eyes closed. Evoked: orientation bars. Spike triggered averages of VSD imaging (population activities) were computed using spontaneous and evoked activities. The two were similar.
- Kenet, T., Bibitchkov, D., Tsodyks, M., Grinvald, A., & Arieli, A. (2003). Spontaneously emerging cortical representations of visual attributes, 425(October).
Modling study that incorporates a trial-by-trial variability
Spike-triggered average and covariance
- % Paninski, L. (2003). Convergence properties of three spike-triggered analysis techniques. Network: Computation in Neural Systems, 14(3), 437–464. doi:10.1088/0954-898X/14/3/304
- Estebanez, L., Boustani, S. El, Destexhe, A., & Shulz, D. E. (2012). Correlated input reveals coexisting coding schemes in a sensory cortex. Nature Neuroscience, 15(12). doi:10.1038/nn.3258
Data set
Neural Signal Archive http://www.neuralsignal.org/
CRCNS http://crcns.org/data-sets/
Neural Prediction Challenge http://neuralprediction.berkeley.edu
References
Prof. Liam Paninski course page