tensorflow - CNN-> LSTM network for videos -
i have x number of videos , each video has different number of frames, let's y(x). frames size same videos 224x224x3. passing each frame cnn , outputs feature vector of 1024. now, want pass lstm. lstm batch_size, time_steps , number_of_feature required. how should decide value ? have 2 configurations in mind not know how should proceed.
- should break 1024 32 x 32 define time_steps , number_of_features , batch_size number of frames 
- should time_step should corresponding number of frames , number_of_feature should 1024 , batch_size (?) 
so depends on problem trying solve.
action classification using videos?
if trying predict action/event video have use num_of_frames time_steps, , batch_size number of videos want process together. 
per frame object classification ?
in case can split features 32x32 time_steps, 
Comments
Post a Comment