我想计算事件/刺激触发平均值的计算效率。
假设我有一个signal
,例如
signal = [random.random() for i in xrange(0, 1000)]
使用n_signal
数据点
n_signal = len(signal)
我知道这个信号是以
的速率采样的Fs = 25000 # Hz
在这种情况下,我知道信号的总时间
T_sec = n_signal / float(Fs)
在特定时间,会发生某些事件,例如
t_events = [0.01, 0.017, 0.018, 0.022, 0.034, 0.0345, 0.03456]
现在我想在这些事件之前的某个时间找到信号,例如
t_bef = 0.001
直到这些事件发生后的某个时间,例如
t_aft = 0.002
一旦我获得了所有这些信号块,我想平均这些。 在过去,我会创建信号的时间向量
t_signal = numpy.linspace(0, T_sec, n_signal)
并查找t_events
中t_signal
的所有索引,例如使用numpy.serachsorted
(Link)
因为我知道信号的采样率,所以这些可以更快地完成,比如
indices = [int(i * Fs) for i in t_events]
这为t_signal
节省了我的记忆,而且我不需要通过整个信号来查找我的索引。
接下来,我将确定有多少数据样本t_bef
和t_aft
对应
nsamples_t_bef = int(t_bef * Fs)
nsamples_t_aft = int(t_aft * Fs)
我会将信号块保存在list
signal_chunks = list()
for i in xrange(0, len(t_events)):
signal_chunks.append(signal[indices[i] - nsamples_t_bef : indices[i] + nsamples_t_aft])
最后我正在平均这些
event_triggered_average = numpy.mean(signal_chunks, axis = 0)
如果我对时间向量感兴趣,我用
计算它t_event_triggered_average = numpy.linspace(-t_signal[nsamples_t_bef], t_signal[nsamples_t_aft], nsamples_t_bef + nsamples_t_aft)
现在我的问题:有一种更有效的计算方法吗?如果我有一个包含许多数据点和许多事件的信号,这个计算可能需要一段时间。 list
是保存这些块的最佳数据结构吗?您知道如何更快地获取数据块吗?也许使用缓冲区?
提前感谢您的意见和建议。
最低工作示例
import numpy
import random
random.seed(0)
signal = [random.random() for i in xrange(0, 1000)]
# sampling rate
Fs = 25000 # Hz
# total time of the signal
n_signal = len(signal)
T_sec = n_signal / float(Fs)
# time of events of interest
t_events = [0.01, 0.017, 0.018, 0.022, 0.034, 0.0345, 0.03456]
# and their corresponding indices
indices = [int(i * Fs) for i in t_events]
# define the time window of interest around each event
t_bef = 0.001
t_aft = 0.002
# and the corresponding index offset
nsamples_t_bef = int(t_bef * Fs)
nsamples_t_aft = int(t_aft * Fs)
# vector of signal times
t_signal = numpy.linspace(0, T_sec, n_signal)
signal_chunks = list()
for i in xrange(0, len(t_events)):
signal_chunks.append(signal[indices[i] - nsamples_t_bef : indices[i] + nsamples_t_aft])
# average signal value across chunks
event_triggered_average = numpy.mean(signal_chunks, axis = 0)
# not sure what's going on here
t_event_triggered_average = numpy.linspace(-t_signal[nsamples_t_bef],
t_signal[nsamples_t_aft],
nsamples_t_bef + nsamples_t_aft)
答案 0 :(得分:3)
由于您的信号是在常规网格上定义的,因此您可以执行一些算术来查找所需的所有样本的索引。然后,您可以使用单个索引操作构建具有块的数组。
import numpy as np
# Making some test data
n_signal = 1000
signal = np.random.rand(n_signal)
Fs = 25000 # Hz
t_events = np.array([0.01, 0.017, 0.018, 0.022, 0.034, 0.0345, 0.03456])
# Preferences
t_bef = 0.001
t_aft = 0.002
# The number of samples in a chunk
nsamples = int((t_bef+t_aft) * Fs)
# Create a vector from 0 up to nsamples
sample_idx = np.arange(nsamples)
# Calculate the index of the first sample for each chunk
# Require integers, because it will be used for indexing
start_idx = ((t_events - t_bef) * Fs).astype(int)
# Use broadcasting to create an array with indices
# Each row contains consecutive indices for each chunk
idx = start_idx[:, None] + sample_idx[None, :]
# Get all the chunks using fancy indexing
signal_chunks = signal[idx]
# Calculate the average like you did earlier
event_triggered_average = signal_chunks.mean(axis=0)
注意,.astype(int)
的行不会舍入到最接近的整数,而是向零舍入。