Giter Club home page Giter Club logo

Comments (7)

jzyustc avatar jzyustc commented on August 23, 2024

Hi, I don't know the exact settings of your training, so I will try to describe the details in my training.

I train the diffusion model with Combined noise layers, including Jpeg and Crop, as described in the paper. The message length is 30bits and after fully connection layer it's 256 dimension. The size of images is 128x128. Batch size is 16 and I use one NVIDIA 2080Ti to train. The logging file of training is uploaded in the /result folder, together with the pre-trained model.

The training is not stable so I apply early stopping by the validation result. In fact, the BER of validation varies from about 2% to 10%, and I use the best epoch for testing.

Wish these details can help you.

from mbrs.

jzyustc avatar jzyustc commented on August 23, 2024

Hi, to solve the instability of diffusion model training, I update the training process by adjusting the learning rate dynamically. You can find the detail in README. Wish it can help.

from mbrs.

jzyustc avatar jzyustc commented on August 23, 2024

This issue will keep open, because there might be better methods to solve the unstable problem. Glad to see more ideas.

from mbrs.

SaltyFishhh avatar SaltyFishhh commented on August 23, 2024

嗨,我不知道你的训练的确切设置,所以我将尝试描述我训练中的细节。

我训练扩散模型与组合噪声层,包括Jpeg和作物,如本文所述。消息长度为 30 位,完全连接层后为 256 维度。图像的大小是 128x128。批量大小是 16, 我用一个 Nvidia 2080Ti 训练。培训记录文件与预先培训的模型一起上传到 /结果文件夹中。

培训不稳定,因此我申请提前停止验证结果。事实上,验证的 BER 从大约 2% 到 10% 不等,我使用最好的测试时代。

希望这些细节能帮助你。

Hi, I don't know the exact settings of your training, so I will try to describe the details in my training.

I train the diffusion model with Combined noise layers, including Jpeg and Crop, as described in the paper. The message length is 30bits and after fully connection layer it's 256 dimension. The size of images is 128x128. Batch size is 16 and I use one NVIDIA 2080Ti to train. The logging file of training is uploaded in the /result folder, together with the pre-trained model.

The training is not stable so I apply early stopping by the validation result. In fact, the BER of validation varies from about 2% to 10%, and I use the best epoch for testing.

Wish these details can help you.

I test the pre-trained model that you offered, and the result is same as the paper, but the model that i trained is not good. I don't know if the setting is identical to you, so you could tell me the accurate setting about training. Could you add the parameters into the results ?thx!!!

from mbrs.

jzyustc avatar jzyustc commented on August 23, 2024

Well, You can try our updated training method, and the setting of stage1 is like this :

{
  "project_name": "MBRS_Diffusion",
  "with_diffusion": true,
  "only_decoder": false,
  "epoch_number": 101,
  "batch_size": 16,
  "train_continue": false,
  "train_continue_path": "/",
  "train_continue_epoch": 0,
  "dataset_path": "YourDatasetPath/",
  "save_images_number": 4,
  "H": 128,
  "W": 128,
  "message_length": 30,
  "lr": 1e-3,
  "noise_layers": [
    "Combined([JpegMask(50),Jpeg(50),Identity(),Crop(0.15,0.15)])"
  ]
}

For the finetune stage, just set it as this:

{
  "project_name": "MBRS_Diffusion",
  "with_diffusion": true,
  "only_decoder": false,
  "epoch_number": 51,
  "batch_size": 16,
  "train_continue": true,
  "train_continue_path": "YourStage1Path/",
  "train_continue_epoch": your_epoch,
  "dataset_path": "YourDatasetPath/",
  "save_images_number": 4,
  "H": 128,
  "W": 128,
  "message_length": 30,
  "lr": 1e-4,
  "noise_layers": [
    "Combined([JpegMask(50),Jpeg(50),Identity(),Crop(0.15,0.15)])"
  ]
}

Good luck

from mbrs.

jzyustc avatar jzyustc commented on August 23, 2024

I have trained the model with combined noise layers that is consist of

{JpegTest(50),Jpeg(50),Identity(),Crop(0.15,0.15),GF(2),Cropout(p=0.3),Dropout(p=0.3)}. Is it same as to your combined noise layers?If not, what is the accurate combined noise layers?

Just look at my previous reply. I only use Crop and JPEG (real and simulated) to train, not with Cropout and Dropout, which is the same as what I describe in the paper.

And I have provided the train_settings.json file in the previous reply. Just try to use it.

And please notice that, 'JpegTest' is only used in testing because it use PIL to save .jpg image. For training, please use 'JpegMask' or 'JpegSS' for simulated JPEG, and use 'Jpeg' for real JPEG.

from mbrs.

jzyustc avatar jzyustc commented on August 23, 2024

Please be more careful of my reply and the paper, thanks

In previous reply :
image

In arxiv paper:
image

from mbrs.

Related Issues (16)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.