-
Notifications
You must be signed in to change notification settings - Fork 222
size mismatch for base model #39
Copy link
Copy link
Open
Description
Hi, when I use the provided base_model to make an inference, during initialization, it report an error
size mismatch for blocks.xx.action_model.keyboard_embed.0.weight: copying a param with shape torch.Size([128, 6]) from checkpoint, the shape in current model is torch.Size([128, 4]).
What do these two extra dimensions represent?
Could you please guide me on how to resolve this issue?
I truly appreciate your help.
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
No labels