I have my own pretrained Pegasus model, now I want to finetune using BigBird, so this is my mapping function,
OrderedDict([('decoder/LayerNorm/beta',
<tf.Variable 'pegasus/decoder/LayerNorm/beta:0' shape=(512,) dtype=float32_ref>),
('decoder/LayerNorm/gamma',
<tf.Variable 'pegasus/decoder/LayerNorm/gamma:0' shape=(512,) dtype=float32_ref>),
('decoder/layer_0/attention/self/LayerNorm/beta',
<tf.Variable 'pegasus/decoder/layer_0/attention/self/LayerNorm/beta:0' shape=(512,) dtype=float32_ref>),
('decoder/layer_0/attention/self/LayerNorm/gamma',
<tf.Variable 'pegasus/decoder/layer_0/attention/self/LayerNorm/gamma:0' shape=(512,) dtype=float32_ref>),
('decoder/layer_0/attention/self/key/kernel',
<tf.Variable 'pegasus/decoder/layer_0/attention/self/key/kernel:0' shape=(512, 512) dtype=float32_ref>),
('decoder/layer_0/attention/self/output/dense/kernel',
<tf.Variable 'pegasus/decoder/layer_0/attention/output/dense/kernel:0' shape=(512, 512) dtype=float32_ref>),
('decoder/layer_0/attention/self/query/kernel',
<tf.Variable 'pegasus/decoder/layer_0/attention/self/query/kernel:0' shape=(512, 512) dtype=float32_ref>),
('decoder/layer_0/attention/self/value/kernel',
<tf.Variable 'pegasus/decoder/layer_0/attention/self/value/kernel:0' shape=(512, 512) dtype=float32_ref>),
('decoder/layer_0/ffn/LayerNorm/beta',
<tf.Variable 'pegasus/decoder/layer_0/intermediate/LayerNorm/beta:0' shape=(512,) dtype=float32_ref>),
('decoder/layer_0/ffn/LayerNorm/gamma',
<tf.Variable 'pegasus/decoder/layer_0/intermediate/LayerNorm/gamma:0' shape=(512,) dtype=float32_ref>),
('decoder/layer_0/ffn/dense/bias',
<tf.Variable 'pegasus/decoder/layer_0/intermediate/dense/bias:0' shape=(3072,) dtype=float32_ref>),
('decoder/layer_0/ffn/dense/kernel',
<tf.Variable 'pegasus/decoder/layer_0/intermediate/dense/kernel:0' shape=(512, 3072) dtype=float32_ref>),
('decoder/layer_0/ffn/dense_1/bias',
<tf.Variable 'pegasus/decoder/layer_0/output/dense/bias:0' shape=(512,) dtype=float32_ref>),
('decoder/layer_0/ffn/dense_1/kernel',
<tf.Variable 'pegasus/decoder/layer_0/output/dense/kernel:0' shape=(3072, 512) dtype=float32_ref>),
('decoder/layer_0/memory_attention/LayerNorm/beta',
<tf.Variable 'pegasus/decoder/layer_0/attention/encdec/LayerNorm/beta:0' shape=(512,) dtype=float32_ref>),
('decoder/layer_0/memory_attention/LayerNorm/gamma',
<tf.Variable 'pegasus/decoder/layer_0/attention/encdec/LayerNorm/gamma:0' shape=(512,) dtype=float32_ref>),
('decoder/layer_0/memory_attention/key/kernel',
<tf.Variable 'pegasus/decoder/layer_0/attention/encdec/key/kernel:0' shape=(512, 512) dtype=float32_ref>),
('decoder/layer_0/memory_attention/output/dense/kernel',
<tf.Variable 'pegasus/decoder/layer_0/attention/encdec_output/dense/kernel:0' shape=(512, 512) dtype=float32_ref>),
('decoder/layer_0/memory_attention/query/kernel',
<tf.Variable 'pegasus/decoder/layer_0/attention/encdec/query/kernel:0' shape=(512, 512) dtype=float32_ref>),
('decoder/layer_0/memory_attention/value/kernel',
<tf.Variable 'pegasus/decoder/layer_0/attention/encdec/value/kernel:0' shape=(512, 512) dtype=float32_ref>),
('decoder/layer_1/attention/self/LayerNorm/beta',
<tf.Variable 'pegasus/decoder/layer_1/attention/self/LayerNorm/beta:0' shape=(512,) dtype=float32_ref>),
('decoder/layer_1/attention/self/LayerNorm/gamma',
<tf.Variable 'pegasus/decoder/layer_1/attention/self/LayerNorm/gamma:0' shape=(512,) dtype=float32_ref>),
('decoder/layer_1/attention/self/key/kernel',
<tf.Variable 'pegasus/decoder/layer_1/attention/self/key/kernel:0' shape=(512, 512) dtype=float32_ref>),
('decoder/layer_1/attention/self/output/dense/kernel',
<tf.Variable 'pegasus/decoder/layer_1/attention/output/dense/kernel:0' shape=(512, 512) dtype=float32_ref>),
('decoder/layer_1/attention/self/query/kernel',
<tf.Variable 'pegasus/decoder/layer_1/attention/self/query/kernel:0' shape=(512, 512) dtype=float32_ref>),
('decoder/layer_1/attention/self/value/kernel',
<tf.Variable 'pegasus/decoder/layer_1/attention/self/value/kernel:0' shape=(512, 512) dtype=float32_ref>),
('decoder/layer_1/ffn/LayerNorm/beta',
<tf.Variable 'pegasus/decoder/layer_1/intermediate/LayerNorm/beta:0' shape=(512,) dtype=float32_ref>),
('decoder/layer_1/ffn/LayerNorm/gamma',
<tf.Variable 'pegasus/decoder/layer_1/intermediate/LayerNorm/gamma:0' shape=(512,) dtype=float32_ref>),
('decoder/layer_1/ffn/dense/bias',
<tf.Variable 'pegasus/decoder/layer_1/intermediate/dense/bias:0' shape=(3072,) dtype=float32_ref>),
('decoder/layer_1/ffn/dense/kernel',
<tf.Variable 'pegasus/decoder/layer_1/intermediate/dense/kernel:0' shape=(512, 3072) dtype=float32_ref>),
('decoder/layer_1/ffn/dense_1/bias',
<tf.Variable 'pegasus/decoder/layer_1/output/dense/bias:0' shape=(512,) dtype=float32_ref>),
('decoder/layer_1/ffn/dense_1/kernel',
<tf.Variable 'pegasus/decoder/layer_1/output/dense/kernel:0' shape=(3072, 512) dtype=float32_ref>),
('decoder/layer_1/memory_attention/LayerNorm/beta',
<tf.Variable 'pegasus/decoder/layer_1/attention/encdec/LayerNorm/beta:0' shape=(512,) dtype=float32_ref>),
('decoder/layer_1/memory_attention/LayerNorm/gamma',
<tf.Variable 'pegasus/decoder/layer_1/attention/encdec/LayerNorm/gamma:0' shape=(512,) dtype=float32_ref>),
('decoder/layer_1/memory_attention/key/kernel',
<tf.Variable 'pegasus/decoder/layer_1/attention/encdec/key/kernel:0' shape=(512, 512) dtype=float32_ref>),
('decoder/layer_1/memory_attention/output/dense/kernel',
<tf.Variable 'pegasus/decoder/layer_1/attention/encdec_output/dense/kernel:0' shape=(512, 512) dtype=float32_ref>),
('decoder/layer_1/memory_attention/query/kernel',
<tf.Variable 'pegasus/decoder/layer_1/attention/encdec/query/kernel:0' shape=(512, 512) dtype=float32_ref>),
('decoder/layer_1/memory_attention/value/kernel',
<tf.Variable 'pegasus/decoder/layer_1/attention/encdec/value/kernel:0' shape=(512, 512) dtype=float32_ref>),
('decoder/layer_2/attention/self/LayerNorm/beta',
<tf.Variable 'pegasus/decoder/layer_2/attention/self/LayerNorm/beta:0' shape=(512,) dtype=float32_ref>),
('decoder/layer_2/attention/self/LayerNorm/gamma',
<tf.Variable 'pegasus/decoder/layer_2/attention/self/LayerNorm/gamma:0' shape=(512,) dtype=float32_ref>),
('decoder/layer_2/attention/self/key/kernel',
<tf.Variable 'pegasus/decoder/layer_2/attention/self/key/kernel:0' shape=(512, 512) dtype=float32_ref>),
('decoder/layer_2/attention/self/output/dense/kernel',
<tf.Variable 'pegasus/decoder/layer_2/attention/output/dense/kernel:0' shape=(512, 512) dtype=float32_ref>),
('decoder/layer_2/attention/self/query/kernel',
<tf.Variable 'pegasus/decoder/layer_2/attention/self/query/kernel:0' shape=(512, 512) dtype=float32_ref>),
('decoder/layer_2/attention/self/value/kernel',
<tf.Variable 'pegasus/decoder/layer_2/attention/self/value/kernel:0' shape=(512, 512) dtype=float32_ref>),
('decoder/layer_2/ffn/LayerNorm/beta',
<tf.Variable 'pegasus/decoder/layer_2/intermediate/LayerNorm/beta:0' shape=(512,) dtype=float32_ref>),
('decoder/layer_2/ffn/LayerNorm/gamma',
<tf.Variable 'pegasus/decoder/layer_2/intermediate/LayerNorm/gamma:0' shape=(512,) dtype=float32_ref>),
('decoder/layer_2/ffn/dense/bias',
<tf.Variable 'pegasus/decoder/layer_2/intermediate/dense/bias:0' shape=(3072,) dtype=float32_ref>),
('decoder/layer_2/ffn/dense/kernel',
<tf.Variable 'pegasus/decoder/layer_2/intermediate/dense/kernel:0' shape=(512, 3072) dtype=float32_ref>),
('decoder/layer_2/ffn/dense_1/bias',
<tf.Variable 'pegasus/decoder/layer_2/output/dense/bias:0' shape=(512,) dtype=float32_ref>),
('decoder/layer_2/ffn/dense_1/kernel',
<tf.Variable 'pegasus/decoder/layer_2/output/dense/kernel:0' shape=(3072, 512) dtype=float32_ref>),
('decoder/layer_2/memory_attention/LayerNorm/beta',
<tf.Variable 'pegasus/decoder/layer_2/attention/encdec/LayerNorm/beta:0' shape=(512,) dtype=float32_ref>),
('decoder/layer_2/memory_attention/LayerNorm/gamma',
<tf.Variable 'pegasus/decoder/layer_2/attention/encdec/LayerNorm/gamma:0' shape=(512,) dtype=float32_ref>),
('decoder/layer_2/memory_attention/key/kernel',
<tf.Variable 'pegasus/decoder/layer_2/attention/encdec/key/kernel:0' shape=(512, 512) dtype=float32_ref>),
('decoder/layer_2/memory_attention/output/dense/kernel',
<tf.Variable 'pegasus/decoder/layer_2/attention/encdec_output/dense/kernel:0' shape=(512, 512) dtype=float32_ref>),
('decoder/layer_2/memory_attention/query/kernel',
<tf.Variable 'pegasus/decoder/layer_2/attention/encdec/query/kernel:0' shape=(512, 512) dtype=float32_ref>),
('decoder/layer_2/memory_attention/value/kernel',
<tf.Variable 'pegasus/decoder/layer_2/attention/encdec/value/kernel:0' shape=(512, 512) dtype=float32_ref>),
('decoder/layer_3/attention/self/LayerNorm/beta',
<tf.Variable 'pegasus/decoder/layer_3/attention/self/LayerNorm/beta:0' shape=(512,) dtype=float32_ref>),
('decoder/layer_3/attention/self/LayerNorm/gamma',
<tf.Variable 'pegasus/decoder/layer_3/attention/self/LayerNorm/gamma:0' shape=(512,) dtype=float32_ref>),
('decoder/layer_3/attention/self/key/kernel',
<tf.Variable 'pegasus/decoder/layer_3/attention/self/key/kernel:0' shape=(512, 512) dtype=float32_ref>),
('decoder/layer_3/attention/self/output/dense/kernel',
<tf.Variable 'pegasus/decoder/layer_3/attention/output/dense/kernel:0' shape=(512, 512) dtype=float32_ref>),
('decoder/layer_3/attention/self/query/kernel',
<tf.Variable 'pegasus/decoder/layer_3/attention/self/query/kernel:0' shape=(512, 512) dtype=float32_ref>),
('decoder/layer_3/attention/self/value/kernel',
<tf.Variable 'pegasus/decoder/layer_3/attention/self/value/kernel:0' shape=(512, 512) dtype=float32_ref>),
('decoder/layer_3/ffn/LayerNorm/beta',
<tf.Variable 'pegasus/decoder/layer_3/intermediate/LayerNorm/beta:0' shape=(512,) dtype=float32_ref>),
('decoder/layer_3/ffn/LayerNorm/gamma',
<tf.Variable 'pegasus/decoder/layer_3/intermediate/LayerNorm/gamma:0' shape=(512,) dtype=float32_ref>),
('decoder/layer_3/ffn/dense/bias',
<tf.Variable 'pegasus/decoder/layer_3/intermediate/dense/bias:0' shape=(3072,) dtype=float32_ref>),
('decoder/layer_3/ffn/dense/kernel',
<tf.Variable 'pegasus/decoder/layer_3/intermediate/dense/kernel:0' shape=(512, 3072) dtype=float32_ref>),
('decoder/layer_3/ffn/dense_1/bias',
<tf.Variable 'pegasus/decoder/layer_3/output/dense/bias:0' shape=(512,) dtype=float32_ref>),
('decoder/layer_3/ffn/dense_1/kernel',
<tf.Variable 'pegasus/decoder/layer_3/output/dense/kernel:0' shape=(3072, 512) dtype=float32_ref>),
('decoder/layer_3/memory_attention/LayerNorm/beta',
<tf.Variable 'pegasus/decoder/layer_3/attention/encdec/LayerNorm/beta:0' shape=(512,) dtype=float32_ref>),
('decoder/layer_3/memory_attention/LayerNorm/gamma',
<tf.Variable 'pegasus/decoder/layer_3/attention/encdec/LayerNorm/gamma:0' shape=(512,) dtype=float32_ref>),
('decoder/layer_3/memory_attention/key/kernel',
<tf.Variable 'pegasus/decoder/layer_3/attention/encdec/key/kernel:0' shape=(512, 512) dtype=float32_ref>),
('decoder/layer_3/memory_attention/output/dense/kernel',
<tf.Variable 'pegasus/decoder/layer_3/attention/encdec_output/dense/kernel:0' shape=(512, 512) dtype=float32_ref>),
('decoder/layer_3/memory_attention/query/kernel',
<tf.Variable 'pegasus/decoder/layer_3/attention/encdec/query/kernel:0' shape=(512, 512) dtype=float32_ref>),
('decoder/layer_3/memory_attention/value/kernel',
<tf.Variable 'pegasus/decoder/layer_3/attention/encdec/value/kernel:0' shape=(512, 512) dtype=float32_ref>),
('decoder/layer_4/attention/self/LayerNorm/beta',
<tf.Variable 'pegasus/decoder/layer_4/attention/self/LayerNorm/beta:0' shape=(512,) dtype=float32_ref>),
('decoder/layer_4/attention/self/LayerNorm/gamma',
<tf.Variable 'pegasus/decoder/layer_4/attention/self/LayerNorm/gamma:0' shape=(512,) dtype=float32_ref>),
('decoder/layer_4/attention/self/key/kernel',
<tf.Variable 'pegasus/decoder/layer_4/attention/self/key/kernel:0' shape=(512, 512) dtype=float32_ref>),
('decoder/layer_4/attention/self/output/dense/kernel',
<tf.Variable 'pegasus/decoder/layer_4/attention/output/dense/kernel:0' shape=(512, 512) dtype=float32_ref>),
('decoder/layer_4/attention/self/query/kernel',
<tf.Variable 'pegasus/decoder/layer_4/attention/self/query/kernel:0' shape=(512, 512) dtype=float32_ref>),
('decoder/layer_4/attention/self/value/kernel',
<tf.Variable 'pegasus/decoder/layer_4/attention/self/value/kernel:0' shape=(512, 512) dtype=float32_ref>),
('decoder/layer_4/ffn/LayerNorm/beta',
<tf.Variable 'pegasus/decoder/layer_4/intermediate/LayerNorm/beta:0' shape=(512,) dtype=float32_ref>),
('decoder/layer_4/ffn/LayerNorm/gamma',
<tf.Variable 'pegasus/decoder/layer_4/intermediate/LayerNorm/gamma:0' shape=(512,) dtype=float32_ref>),
('decoder/layer_4/ffn/dense/bias',
<tf.Variable 'pegasus/decoder/layer_4/intermediate/dense/bias:0' shape=(3072,) dtype=float32_ref>),
('decoder/layer_4/ffn/dense/kernel',
<tf.Variable 'pegasus/decoder/layer_4/intermediate/dense/kernel:0' shape=(512, 3072) dtype=float32_ref>),
('decoder/layer_4/ffn/dense_1/bias',
<tf.Variable 'pegasus/decoder/layer_4/output/dense/bias:0' shape=(512,) dtype=float32_ref>),
('decoder/layer_4/ffn/dense_1/kernel',
<tf.Variable 'pegasus/decoder/layer_4/output/dense/kernel:0' shape=(3072, 512) dtype=float32_ref>),
('decoder/layer_4/memory_attention/LayerNorm/beta',
<tf.Variable 'pegasus/decoder/layer_4/attention/encdec/LayerNorm/beta:0' shape=(512,) dtype=float32_ref>),
('decoder/layer_4/memory_attention/LayerNorm/gamma',
<tf.Variable 'pegasus/decoder/layer_4/attention/encdec/LayerNorm/gamma:0' shape=(512,) dtype=float32_ref>),
('decoder/layer_4/memory_attention/key/kernel',
<tf.Variable 'pegasus/decoder/layer_4/attention/encdec/key/kernel:0' shape=(512, 512) dtype=float32_ref>),
('decoder/layer_4/memory_attention/output/dense/kernel',
<tf.Variable 'pegasus/decoder/layer_4/attention/encdec_output/dense/kernel:0' shape=(512, 512) dtype=float32_ref>),
('decoder/layer_4/memory_attention/query/kernel',
<tf.Variable 'pegasus/decoder/layer_4/attention/encdec/query/kernel:0' shape=(512, 512) dtype=float32_ref>),
('decoder/layer_4/memory_attention/value/kernel',
<tf.Variable 'pegasus/decoder/layer_4/attention/encdec/value/kernel:0' shape=(512, 512) dtype=float32_ref>),
('decoder/layer_5/attention/self/LayerNorm/beta',
<tf.Variable 'pegasus/decoder/layer_5/attention/self/LayerNorm/beta:0' shape=(512,) dtype=float32_ref>),
('decoder/layer_5/attention/self/LayerNorm/gamma',
<tf.Variable 'pegasus/decoder/layer_5/attention/self/LayerNorm/gamma:0' shape=(512,) dtype=float32_ref>),
('decoder/layer_5/attention/self/key/kernel',
<tf.Variable 'pegasus/decoder/layer_5/attention/self/key/kernel:0' shape=(512, 512) dtype=float32_ref>),
('decoder/layer_5/attention/self/output/dense/kernel',
<tf.Variable 'pegasus/decoder/layer_5/attention/output/dense/kernel:0' shape=(512, 512) dtype=float32_ref>),
('decoder/layer_5/attention/self/query/kernel',
<tf.Variable 'pegasus/decoder/layer_5/attention/self/query/kernel:0' shape=(512, 512) dtype=float32_ref>),
('decoder/layer_5/attention/self/value/kernel',
<tf.Variable 'pegasus/decoder/layer_5/attention/self/value/kernel:0' shape=(512, 512) dtype=float32_ref>),
('decoder/layer_5/ffn/LayerNorm/beta',
<tf.Variable 'pegasus/decoder/layer_5/intermediate/LayerNorm/beta:0' shape=(512,) dtype=float32_ref>),
('decoder/layer_5/ffn/LayerNorm/gamma',
<tf.Variable 'pegasus/decoder/layer_5/intermediate/LayerNorm/gamma:0' shape=(512,) dtype=float32_ref>),
('decoder/layer_5/ffn/dense/bias',
<tf.Variable 'pegasus/decoder/layer_5/intermediate/dense/bias:0' shape=(3072,) dtype=float32_ref>),
('decoder/layer_5/ffn/dense/kernel',
<tf.Variable 'pegasus/decoder/layer_5/intermediate/dense/kernel:0' shape=(512, 3072) dtype=float32_ref>),
('decoder/layer_5/ffn/dense_1/bias',
<tf.Variable 'pegasus/decoder/layer_5/output/dense/bias:0' shape=(512,) dtype=float32_ref>),
('decoder/layer_5/ffn/dense_1/kernel',
<tf.Variable 'pegasus/decoder/layer_5/output/dense/kernel:0' shape=(3072, 512) dtype=float32_ref>),
('decoder/layer_5/memory_attention/LayerNorm/beta',
<tf.Variable 'pegasus/decoder/layer_5/attention/encdec/LayerNorm/beta:0' shape=(512,) dtype=float32_ref>),
('decoder/layer_5/memory_attention/LayerNorm/gamma',
<tf.Variable 'pegasus/decoder/layer_5/attention/encdec/LayerNorm/gamma:0' shape=(512,) dtype=float32_ref>),
('decoder/layer_5/memory_attention/key/kernel',
<tf.Variable 'pegasus/decoder/layer_5/attention/encdec/key/kernel:0' shape=(512, 512) dtype=float32_ref>),
('decoder/layer_5/memory_attention/output/dense/kernel',
<tf.Variable 'pegasus/decoder/layer_5/attention/encdec_output/dense/kernel:0' shape=(512, 512) dtype=float32_ref>),
('decoder/layer_5/memory_attention/query/kernel',
<tf.Variable 'pegasus/decoder/layer_5/attention/encdec/query/kernel:0' shape=(512, 512) dtype=float32_ref>),
('decoder/layer_5/memory_attention/value/kernel',
<tf.Variable 'pegasus/decoder/layer_5/attention/encdec/value/kernel:0' shape=(512, 512) dtype=float32_ref>),
('embeddings/weights',
<tf.Variable 'pegasus/embeddings/word_embeddings:0' shape=(32128, 512) dtype=float32_ref>),
('encoder/LayerNorm/beta',
<tf.Variable 'pegasus/encoder/LayerNorm/beta:0' shape=(512,) dtype=float32_ref>),
('encoder/LayerNorm/gamma',
<tf.Variable 'pegasus/encoder/LayerNorm/gamma:0' shape=(512,) dtype=float32_ref>),
('encoder/layer_0/attention/self/LayerNorm/beta',
<tf.Variable 'pegasus/encoder/layer_0/attention/self/LayerNorm/beta:0' shape=(512,) dtype=float32_ref>),
('encoder/layer_0/attention/self/LayerNorm/gamma',
<tf.Variable 'pegasus/encoder/layer_0/attention/self/LayerNorm/gamma:0' shape=(512,) dtype=float32_ref>),
('encoder/layer_0/attention/self/key/kernel',
<tf.Variable 'pegasus/encoder/layer_0/attention/self/key/kernel:0' shape=(512, 512) dtype=float32_ref>),
('encoder/layer_0/attention/self/output/dense/kernel',
<tf.Variable 'pegasus/encoder/layer_0/attention/output/dense/kernel:0' shape=(512, 512) dtype=float32_ref>),
('encoder/layer_0/attention/self/query/kernel',
<tf.Variable 'pegasus/encoder/layer_0/attention/self/query/kernel:0' shape=(512, 512) dtype=float32_ref>),
('encoder/layer_0/attention/self/value/kernel',
<tf.Variable 'pegasus/encoder/layer_0/attention/self/value/kernel:0' shape=(512, 512) dtype=float32_ref>),
('encoder/layer_0/ffn/LayerNorm/beta',
<tf.Variable 'pegasus/encoder/layer_0/intermediate/LayerNorm/beta:0' shape=(512,) dtype=float32_ref>),
('encoder/layer_0/ffn/LayerNorm/gamma',
<tf.Variable 'pegasus/encoder/layer_0/intermediate/LayerNorm/gamma:0' shape=(512,) dtype=float32_ref>),
('encoder/layer_0/ffn/dense/bias',
<tf.Variable 'pegasus/encoder/layer_0/intermediate/dense/bias:0' shape=(3072,) dtype=float32_ref>),
('encoder/layer_0/ffn/dense/kernel',
<tf.Variable 'pegasus/encoder/layer_0/intermediate/dense/kernel:0' shape=(512, 3072) dtype=float32_ref>),
('encoder/layer_0/ffn/dense_1/bias',
<tf.Variable 'pegasus/encoder/layer_0/output/dense/bias:0' shape=(512,) dtype=float32_ref>),
('encoder/layer_0/ffn/dense_1/kernel',
<tf.Variable 'pegasus/encoder/layer_0/output/dense/kernel:0' shape=(3072, 512) dtype=float32_ref>),
('encoder/layer_1/attention/self/LayerNorm/beta',
<tf.Variable 'pegasus/encoder/layer_1/attention/self/LayerNorm/beta:0' shape=(512,) dtype=float32_ref>),
('encoder/layer_1/attention/self/LayerNorm/gamma',
<tf.Variable 'pegasus/encoder/layer_1/attention/self/LayerNorm/gamma:0' shape=(512,) dtype=float32_ref>),
('encoder/layer_1/attention/self/key/kernel',
<tf.Variable 'pegasus/encoder/layer_1/attention/self/key/kernel:0' shape=(512, 512) dtype=float32_ref>),
('encoder/layer_1/attention/self/output/dense/kernel',
<tf.Variable 'pegasus/encoder/layer_1/attention/output/dense/kernel:0' shape=(512, 512) dtype=float32_ref>),
('encoder/layer_1/attention/self/query/kernel',
<tf.Variable 'pegasus/encoder/layer_1/attention/self/query/kernel:0' shape=(512, 512) dtype=float32_ref>),
('encoder/layer_1/attention/self/value/kernel',
<tf.Variable 'pegasus/encoder/layer_1/attention/self/value/kernel:0' shape=(512, 512) dtype=float32_ref>),
('encoder/layer_1/ffn/LayerNorm/beta',
<tf.Variable 'pegasus/encoder/layer_1/intermediate/LayerNorm/beta:0' shape=(512,) dtype=float32_ref>),
('encoder/layer_1/ffn/LayerNorm/gamma',
<tf.Variable 'pegasus/encoder/layer_1/intermediate/LayerNorm/gamma:0' shape=(512,) dtype=float32_ref>),
('encoder/layer_1/ffn/dense/bias',
<tf.Variable 'pegasus/encoder/layer_1/intermediate/dense/bias:0' shape=(3072,) dtype=float32_ref>),
('encoder/layer_1/ffn/dense/kernel',
<tf.Variable 'pegasus/encoder/layer_1/intermediate/dense/kernel:0' shape=(512, 3072) dtype=float32_ref>),
('encoder/layer_1/ffn/dense_1/bias',
<tf.Variable 'pegasus/encoder/layer_1/output/dense/bias:0' shape=(512,) dtype=float32_ref>),
('encoder/layer_1/ffn/dense_1/kernel',
<tf.Variable 'pegasus/encoder/layer_1/output/dense/kernel:0' shape=(3072, 512) dtype=float32_ref>),
('encoder/layer_2/attention/self/LayerNorm/beta',
<tf.Variable 'pegasus/encoder/layer_2/attention/self/LayerNorm/beta:0' shape=(512,) dtype=float32_ref>),
('encoder/layer_2/attention/self/LayerNorm/gamma',
<tf.Variable 'pegasus/encoder/layer_2/attention/self/LayerNorm/gamma:0' shape=(512,) dtype=float32_ref>),
('encoder/layer_2/attention/self/key/kernel',
<tf.Variable 'pegasus/encoder/layer_2/attention/self/key/kernel:0' shape=(512, 512) dtype=float32_ref>),
('encoder/layer_2/attention/self/output/dense/kernel',
<tf.Variable 'pegasus/encoder/layer_2/attention/output/dense/kernel:0' shape=(512, 512) dtype=float32_ref>),
('encoder/layer_2/attention/self/query/kernel',
<tf.Variable 'pegasus/encoder/layer_2/attention/self/query/kernel:0' shape=(512, 512) dtype=float32_ref>),
('encoder/layer_2/attention/self/value/kernel',
<tf.Variable 'pegasus/encoder/layer_2/attention/self/value/kernel:0' shape=(512, 512) dtype=float32_ref>),
('encoder/layer_2/ffn/LayerNorm/beta',
<tf.Variable 'pegasus/encoder/layer_2/intermediate/LayerNorm/beta:0' shape=(512,) dtype=float32_ref>),
('encoder/layer_2/ffn/LayerNorm/gamma',
<tf.Variable 'pegasus/encoder/layer_2/intermediate/LayerNorm/gamma:0' shape=(512,) dtype=float32_ref>),
('encoder/layer_2/ffn/dense/bias',
<tf.Variable 'pegasus/encoder/layer_2/intermediate/dense/bias:0' shape=(3072,) dtype=float32_ref>),
('encoder/layer_2/ffn/dense/kernel',
<tf.Variable 'pegasus/encoder/layer_2/intermediate/dense/kernel:0' shape=(512, 3072) dtype=float32_ref>),
('encoder/layer_2/ffn/dense_1/bias',
<tf.Variable 'pegasus/encoder/layer_2/output/dense/bias:0' shape=(512,) dtype=float32_ref>),
('encoder/layer_2/ffn/dense_1/kernel',
<tf.Variable 'pegasus/encoder/layer_2/output/dense/kernel:0' shape=(3072, 512) dtype=float32_ref>),
('encoder/layer_3/attention/self/LayerNorm/beta',
<tf.Variable 'pegasus/encoder/layer_3/attention/self/LayerNorm/beta:0' shape=(512,) dtype=float32_ref>),
('encoder/layer_3/attention/self/LayerNorm/gamma',
<tf.Variable 'pegasus/encoder/layer_3/attention/self/LayerNorm/gamma:0' shape=(512,) dtype=float32_ref>),
('encoder/layer_3/attention/self/key/kernel',
<tf.Variable 'pegasus/encoder/layer_3/attention/self/key/kernel:0' shape=(512, 512) dtype=float32_ref>),
('encoder/layer_3/attention/self/output/dense/kernel',
<tf.Variable 'pegasus/encoder/layer_3/attention/output/dense/kernel:0' shape=(512, 512) dtype=float32_ref>),
('encoder/layer_3/attention/self/query/kernel',
<tf.Variable 'pegasus/encoder/layer_3/attention/self/query/kernel:0' shape=(512, 512) dtype=float32_ref>),
('encoder/layer_3/attention/self/value/kernel',
<tf.Variable 'pegasus/encoder/layer_3/attention/self/value/kernel:0' shape=(512, 512) dtype=float32_ref>),
('encoder/layer_3/ffn/LayerNorm/beta',
<tf.Variable 'pegasus/encoder/layer_3/intermediate/LayerNorm/beta:0' shape=(512,) dtype=float32_ref>),
('encoder/layer_3/ffn/LayerNorm/gamma',
<tf.Variable 'pegasus/encoder/layer_3/intermediate/LayerNorm/gamma:0' shape=(512,) dtype=float32_ref>),
('encoder/layer_3/ffn/dense/bias',
<tf.Variable 'pegasus/encoder/layer_3/intermediate/dense/bias:0' shape=(3072,) dtype=float32_ref>),
('encoder/layer_3/ffn/dense/kernel',
<tf.Variable 'pegasus/encoder/layer_3/intermediate/dense/kernel:0' shape=(512, 3072) dtype=float32_ref>),
('encoder/layer_3/ffn/dense_1/bias',
<tf.Variable 'pegasus/encoder/layer_3/output/dense/bias:0' shape=(512,) dtype=float32_ref>),
('encoder/layer_3/ffn/dense_1/kernel',
<tf.Variable 'pegasus/encoder/layer_3/output/dense/kernel:0' shape=(3072, 512) dtype=float32_ref>),
('encoder/layer_4/attention/self/LayerNorm/beta',
<tf.Variable 'pegasus/encoder/layer_4/attention/self/LayerNorm/beta:0' shape=(512,) dtype=float32_ref>),
('encoder/layer_4/attention/self/LayerNorm/gamma',
<tf.Variable 'pegasus/encoder/layer_4/attention/self/LayerNorm/gamma:0' shape=(512,) dtype=float32_ref>),
('encoder/layer_4/attention/self/key/kernel',
<tf.Variable 'pegasus/encoder/layer_4/attention/self/key/kernel:0' shape=(512, 512) dtype=float32_ref>),
('encoder/layer_4/attention/self/output/dense/kernel',
<tf.Variable 'pegasus/encoder/layer_4/attention/output/dense/kernel:0' shape=(512, 512) dtype=float32_ref>),
('encoder/layer_4/attention/self/query/kernel',
<tf.Variable 'pegasus/encoder/layer_4/attention/self/query/kernel:0' shape=(512, 512) dtype=float32_ref>),
('encoder/layer_4/attention/self/value/kernel',
<tf.Variable 'pegasus/encoder/layer_4/attention/self/value/kernel:0' shape=(512, 512) dtype=float32_ref>),
('encoder/layer_4/ffn/LayerNorm/beta',
<tf.Variable 'pegasus/encoder/layer_4/intermediate/LayerNorm/beta:0' shape=(512,) dtype=float32_ref>),
('encoder/layer_4/ffn/LayerNorm/gamma',
<tf.Variable 'pegasus/encoder/layer_4/intermediate/LayerNorm/gamma:0' shape=(512,) dtype=float32_ref>),
('encoder/layer_4/ffn/dense/bias',
<tf.Variable 'pegasus/encoder/layer_4/intermediate/dense/bias:0' shape=(3072,) dtype=float32_ref>),
('encoder/layer_4/ffn/dense/kernel',
<tf.Variable 'pegasus/encoder/layer_4/intermediate/dense/kernel:0' shape=(512, 3072) dtype=float32_ref>),
('encoder/layer_4/ffn/dense_1/bias',
<tf.Variable 'pegasus/encoder/layer_4/output/dense/bias:0' shape=(512,) dtype=float32_ref>),
('encoder/layer_4/ffn/dense_1/kernel',
<tf.Variable 'pegasus/encoder/layer_4/output/dense/kernel:0' shape=(3072, 512) dtype=float32_ref>),
('encoder/layer_5/attention/self/LayerNorm/beta',
<tf.Variable 'pegasus/encoder/layer_5/attention/self/LayerNorm/beta:0' shape=(512,) dtype=float32_ref>),
('encoder/layer_5/attention/self/LayerNorm/gamma',
<tf.Variable 'pegasus/encoder/layer_5/attention/self/LayerNorm/gamma:0' shape=(512,) dtype=float32_ref>),
('encoder/layer_5/attention/self/key/kernel',
<tf.Variable 'pegasus/encoder/layer_5/attention/self/key/kernel:0' shape=(512, 512) dtype=float32_ref>),
('encoder/layer_5/attention/self/output/dense/kernel',
<tf.Variable 'pegasus/encoder/layer_5/attention/output/dense/kernel:0' shape=(512, 512) dtype=float32_ref>),
('encoder/layer_5/attention/self/query/kernel',
<tf.Variable 'pegasus/encoder/layer_5/attention/self/query/kernel:0' shape=(512, 512) dtype=float32_ref>),
('encoder/layer_5/attention/self/value/kernel',
<tf.Variable 'pegasus/encoder/layer_5/attention/self/value/kernel:0' shape=(512, 512) dtype=float32_ref>),
('encoder/layer_5/ffn/LayerNorm/beta',
<tf.Variable 'pegasus/encoder/layer_5/intermediate/LayerNorm/beta:0' shape=(512,) dtype=float32_ref>),
('encoder/layer_5/ffn/LayerNorm/gamma',
<tf.Variable 'pegasus/encoder/layer_5/intermediate/LayerNorm/gamma:0' shape=(512,) dtype=float32_ref>),
('encoder/layer_5/ffn/dense/bias',
<tf.Variable 'pegasus/encoder/layer_5/intermediate/dense/bias:0' shape=(3072,) dtype=float32_ref>),
('encoder/layer_5/ffn/dense/kernel',
<tf.Variable 'pegasus/encoder/layer_5/intermediate/dense/kernel:0' shape=(512, 3072) dtype=float32_ref>),
('encoder/layer_5/ffn/dense_1/bias',
<tf.Variable 'pegasus/encoder/layer_5/output/dense/bias:0' shape=(512,) dtype=float32_ref>),
('encoder/layer_5/ffn/dense_1/kernel',
<tf.Variable 'pegasus/encoder/layer_5/output/dense/kernel:0' shape=(3072, 512) dtype=float32_ref>)])
Not sure this is the correct one, finetuning is really slow, so any guide about variable mapping is really helpful.