Comments (6)
As suggested in my pull request, I believe the best way is to use the python builtin tempfile, as it has no extra dependencies, and is well tested, and cross platform.
import tempfile
cache_dir=tempfile.gettempdir()
Do you see this replacing the home directory fallback, or as a second layer of fallback?
from urlextract.
For my use case its OK to have it as second option - replacing home. Maybe its better place than user's home. Its not that persistent (as you mentioned) but it is fine by me.
from urlextract.
Why not keep the files in a cache directory according to https://standards.freedesktop.org/basedir-spec/basedir-spec-latest.html
There are a few packages which provide cross platform-ish xdg support
https://pypi.org/search/?q=xdg
The most commonly used is https://pypi.org/project/appdirs/
from urlextract.
Thank you for pointing it out.
I used appdirs for getting the user's cache dir.
from urlextract.
Trying to reopen an old ticket since there is a permission error on this implementation.
This line throws permission error when there is not enough privilege (tested in Linux):
https://github.com/lipoja/URLExtract/blob/master/urlextract/cachefile.py#L113
Write permission should be checked first before creating the cache directory.
Thanks!
from urlextract.
This reopened issue should be fixed in v0.14.0.
from urlextract.
Related Issues (20)
- left walk does not stop on various unicode chars HOT 1
- should not grab email fragments HOT 1
- comma extracted at the end if url ends with comma HOT 3
- travis-ci seems no longer active repository HOT 2
- URLExtract() init really slow
- ERROR: Can not download list of TLDs. (URLError: [Errno 104] Connection reset by peer) HOT 2
- Does Not extract the URL that is leading special character
- urlextract without authority causes AttributeError HOT 2
- Wrong indices and incomplete extraction when string contains similar urls HOT 1
- Handle upper-case false positives HOT 9
- Unable to detect t.me links HOT 1
- Bug with flag `allow_mixed_case_hostname=False` HOT 4
- Extracting Markdown Text, doesn't process escaped \\ correctly HOT 1
- Support non-unicode hostname HOT 3
- Support for private/reserved/custom TLDs
- Wrong indices and repeated matches when hostname contains the TLD
- Invalid URLs accepted with subdomains
- red flag from antiviruses HOT 2
- [Errno 11002] Temporary failure in name resolution after using URLExtract HOT 1
- Add `py.typed` marker to source and package
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from urlextract.