Giter Club home page Giter Club logo

Comments (9)

SuryanarayanaY avatar SuryanarayanaY commented on April 28, 2024 4

Hi @Di-Is ,

I have replicated the reported memory leak with XLA. Attached gist for reference.

from tensorflow.

AshwinAmbal avatar AshwinAmbal commented on April 28, 2024 3

This document from NVIDIA on tweaking environment variables for XLA memory could help. We're yet to test this out though and will try to keep this thread updated on any findings:
https://docs.nvidia.com/deeplearning/frameworks/tensorflow-user-guide/index.html#xla-best-practices

Btw, the formation of clusters / Operation fusing / compilation using XLA increases the memory for each different shape of input that the framework comes across. If we keep the shape constant, we've found that after X amount of time, the memory growth stabilizes even with TF 2.15.

from tensorflow.

Di-Is avatar Di-Is commented on April 28, 2024

@NBCBM
Please do not post low-quality LLM-generated text.

from tensorflow.

Venkat6871 avatar Venkat6871 commented on April 28, 2024

Hi @Di-Is ,
I tried to run your code on Colab using TF v2.16 and faced the same issue. Please find the gist here for reference.

Thank you!

from tensorflow.

Di-Is avatar Di-Is commented on April 28, 2024

@Venkat6871
Thank you for your reply!
I am using tensorflow compiled for Nvidia GPU(tensorflow[and-cuda] 2.16.1).
It seems there is also a difference in the Python version you confirmed.

from tensorflow.

Di-Is avatar Di-Is commented on April 28, 2024

I observed memory leaks in Google Colab with T4 GPU.

from tensorflow.

juanma9613 avatar juanma9613 commented on April 28, 2024

@Venkat6871, do you have any progress on this issue.

I'm also affected because of this issue, I tried will all tf versions from 2.11 up to 2.16 and it seems like this happens since tf 2.12. It seems this was not happening for tf 2.11

thank you

from tensorflow.

sgkouzias avatar sgkouzias commented on April 28, 2024

I face the exact problem. Using Ubuntu 20.04, NVIDIA RTX3060 & python=3.11

@Venkat6871, do you have any progress on this issue?

from tensorflow.

BobbyWilt avatar BobbyWilt commented on April 28, 2024

I'm also encountering the exact same problem using Ubuntu 20.04 NVIDIA GTX1070 and python 3.10. Would be great to get this fixed since the memory leakage can get pretty astronomical if left unchecked. I've had it grow up to 12gB until it crashed my kernel.

from tensorflow.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.