Error analysing LSTM model using X-Cube-AI: ""NOT IMPLEMENTED: Order of dimensions of input cannot be interpreted""

Lemanoise · ‎2024-12-12

As stated, I have problem analyzing the ONNX model ported from a pytorch model. The code of the model is simple as follows:

class LSTM(nn.Module):

    def __init__(self, input_size, hidden_size, num_layers, num_output=1):
        super(LSTM, self).__init__()
        
        self.num_layers = num_layers
        self.input_size = input_size
        self.hidden_size = hidden_size
        self.num_output = num_output
        
        self.lstm = nn.LSTM(input_size=input_size, hidden_size=hidden_size,
                            num_layers=num_layers, batch_first=True)
        
        self.fc = nn.Linear(hidden_size, num_output)
        
    def forward(self, x, device='cuda'):
        
        ula, (h_out, _) = self.lstm(x)
        
        out = self.fc(h_out[-1])
        
        return out

Netron vis:

I will really appreciate any advice or workarounds. Thank you!

Julian E. · ‎2024-12-16

Hi everyone,

I did some test on my side and we currently only support LSTM with 1 hidden size and 1 layer. Something like:

import torch
import torch.nn as nn

# Simple lstm that works

class LSTM(nn.Module):
    def __init__(self, input_size, hidden_size, num_layers, num_output=1):
        super(LSTM, self).__init__()
        self.num_layers = num_layers
        self.input_size = input_size
        self.hidden_size = hidden_size
        self.num_output = num_output

        self.lstm = nn.LSTM(input_size=input_size, hidden_size=hidden_size,
                            num_layers=num_layers, batch_first=True, bidirectional=False)
        

        self.fc = nn.Linear(hidden_size, num_output)

    def forward(self, x, device='cuda'):
        ula, (h_out, _) = self.lstm(x)
        out = self.fc(h_out[-1])
        return out

# If not 1 : error
# other parameter we can put anything
input_size = 10 
hidden_size = 1 
num_layers = 1 
num_output = 10  

model = LSTM(input_size, hidden_size, num_layers, num_output)

model.eval()

dummy_input = torch.randn(1, 128, input_size)

onnx_file_path = "simple_lstml.onnx"
torch.onnx.export(model, dummy_input, onnx_file_path, 
                  input_names=['input'], output_names=['output'])

I tried to stack multiple LSTM but I had no success.

I am asking for more information, I will keep you updated

Have a good day,

Julian

In order to give better visibility on the answered topics, please click on 'Accept as Solution' on the reply which solved your issue or answered your question.

View solution in original post

Julian E. · ‎2024-12-12

Hello @Lemanoise,

Can you attach your onnx model in a .zip file please?

In the meantime, make sure you follow the conditions described in the ONNX Toolbox documentation concerning the LSTM:

LSTM

Computes a multi-layer long short-term memory (LSTM) RNN to an input sequence (batch=1, timesteps, features)

category: recurrent layer
input data types: float32
output data types: float32

Specific constraints/recommendations:

stateless mode support only
fused activation: sigmoid
fused recurrent activation: sigmoid
return_state not supported
Only 1 input is allowed for LSTM; the others should be constant and placed into the initializers
layout=1 attribute is not supported

https://stedgeai-dc.st.com/assets/embedded-docs/supported_ops_onnx.html

Have a good day,

Julian

In order to give better visibility on the answered topics, please click on 'Accept as Solution' on the reply which solved your issue or answered your question.

Lemanoise · ‎2024-12-13

Yeah sure, here is the onnx model. Thank you for your advice and I'll look into it as well.

ChengZhou555 · ‎2024-12-15

Hello,I have the same question.Do you slove this question?

Lemanoise · ‎2024-12-16

No not yet, I hope ST staffs will check my model and give us further guidance.

ChengZhou555 · ‎2024-12-16

I read a answer. someone answer cubeai 10.0.0version slolve this question. but I not try yet.

Lemanoise · ‎2024-12-16

I tried the X-Cube-AI 10.0.0 and also the new ST Edge AI Developer Cloud, they did not solve my problem.

Julian E. · ‎2024-12-16

Hi everyone,

I did some test on my side and we currently only support LSTM with 1 hidden size and 1 layer. Something like:

import torch
import torch.nn as nn

# Simple lstm that works

class LSTM(nn.Module):
    def __init__(self, input_size, hidden_size, num_layers, num_output=1):
        super(LSTM, self).__init__()
        self.num_layers = num_layers
        self.input_size = input_size
        self.hidden_size = hidden_size
        self.num_output = num_output

        self.lstm = nn.LSTM(input_size=input_size, hidden_size=hidden_size,
                            num_layers=num_layers, batch_first=True, bidirectional=False)
        

        self.fc = nn.Linear(hidden_size, num_output)

    def forward(self, x, device='cuda'):
        ula, (h_out, _) = self.lstm(x)
        out = self.fc(h_out[-1])
        return out

# If not 1 : error
# other parameter we can put anything
input_size = 10 
hidden_size = 1 
num_layers = 1 
num_output = 10  

model = LSTM(input_size, hidden_size, num_layers, num_output)

model.eval()

dummy_input = torch.randn(1, 128, input_size)

onnx_file_path = "simple_lstml.onnx"
torch.onnx.export(model, dummy_input, onnx_file_path, 
                  input_names=['input'], output_names=['output'])

I tried to stack multiple LSTM but I had no success.

I am asking for more information, I will keep you updated

Have a good day,

Julian

In order to give better visibility on the answered topics, please click on 'Accept as Solution' on the reply which solved your issue or answered your question.

Lemanoise · ‎2024-12-16

Thank you! I have tried the one-layer case and it succeeded. Please keep sharing further information on multi-layer case if any :)

Julian E. · ‎2024-12-16

Hello @Lemanoise ,

Using tensorflow, I could use multiple LSTM one after the other (which I did not succeed to do with pytorch).
It may be useful for you:

import tensorflow as tf
from tensorflow.keras.models import Model
from tensorflow.keras.layers import Input, LSTM, Dense


input_layer = Input(shape=(100, 2), batch_size=1)

lstm1= LSTM(32, activation='relu', stateful=True,return_sequences=True)(input_layer)

lstm2 = LSTM(32, activation='relu', stateful=True,return_sequences=True)(lstm1)

output_layer = Dense(units=3, activation='linear')(lstm2)

# Create the model
model = Model(inputs=input_layer, outputs=output_layer)

# Print the model summary
model.summary()

# Compile the model
model.compile(optimizer='adam', loss='mean_squared_error')


model.save('tf_lstm.h5')

Be careful to follow these instructions while using it:

https://stedgeai-dc.st.com/assets/embedded-docs/keras_lstm_stateful.html

Have a good day,

Julian

In order to give better visibility on the answered topics, please click on 'Accept as Solution' on the reply which solved your issue or answered your question.