blkserene / wordless Goto Github PK
View Code? Open in Web Editor NEWAn Integrated Corpus Tool With Multilingual Support for the Study of Language, Literature, and Translation
License: GNU General Public License v3.0
An Integrated Corpus Tool With Multilingual Support for the Study of Language, Literature, and Translation
License: GNU General Public License v3.0
:-)
But downloading process can not be finished, where shall i put the netdisk downloaded language model?
这个软件功能强大,解决了语言研究者不会编程之苦!太帅了。请问有没有考虑加入统计中英文文本实词和虚词数量以及计算二者之间比率的功能?需要使用这个来计算文本的信息密度。
可考虑使用sqlite来分页读取和lazy load: https://docs.python.org/3/library/sqlite3.html
下载好了wordless 2.1.0, 解压之后打不开exe文件,没有任何提示. 用的是win10系统
each time when I finish import or almost finish, the software will breakdown, plz check!
Describe the bug
Some chars like '
are displayed as HTML in the Sentence column.
To Reproduce
Steps to reproduce the behavior:
echo -e "What's that? The sign & means \"and\"." > sample.txt
Dependency Parser
tabsample.txt
and click Generate table
Expected behavior
Sentences are shown as they actually are.
Environment information
Additional context
I dug a bit and found that with this line commented, the problem disappears.
(The Concordancer
and Concordancer Parallel
tabs are missing in the screenshots because I disabled them after failing to install some of their dependencies. But the screenshots do belong to Wordless version 3.3.0, though run from source code instead of the compiled release.)
“Wordless” is damaged and can’t be opened.
MacBook Pro (13-inch, 2018, Four Thunderbolt 3 Ports)
MacOS 12.1
请问这种情况咋处理。。
hi
thx for this tool, i tried using the OSX version but program does not want to start;
sorry i can't get any diagnostic info from Console for some reason;
my system is 10.11.6
thx
Hi,
I have a few documents mixed in Tibetan and Chinese. I found Wordless would crash multiple times, especially for n-gram, collocation extractor.
I'm not sure if that's because of the size of the corpus. From the profiler, there are 10909 paragraphs, and 172885 tokens, 581964 characters. I remember I tried with small files, but the app crashed too.
I'm on macOS 12.3.1 with M1 Pro chip. I tried another Macbook with intel chip but had the same experience. Wordless version: 2.2.0.
The path to Wordless doesn't have any non-ASCII characters, though file names are in Tibetan and Chinese.
I do have the crash report that generated by the system, but not sure if that's helpful. Please let me know what other information are needed for investigation.
Thanks
是否可考虑替换成Noto Sans字体? https://leonax.net/p/7750/use-noto-sans-cjk-as-default-blog-font/
Not really an issue, but in your next releases, you might want to use botok instead of pybo.
Everything is the same except for the import line.
All the codebase related to the Tibetan tokenizers has been moved to botok. pybo is now a toolbox that imports and uses botok amongst others and provide a convenient command line interface for standard operations.
Describe the bug
i just want to upload a file in order to run the app but it said it has fatal error and freeze immediately
To Reproduce
Steps to reproduce the behavior:
Expected behavior
please fix it as soon as possible
Environment information
Additional context
Add any other context about the problem here.
用wordless分析奥巴马的演讲A Just and Lasting Peace Nobel Peace Prize Lecture Oslo, Noway, December 10, 2009
出现如下提示:
Data processing has completed successfully, but there are no results to display. You can change your settings and try again.
是因为文章太短了吗?该怎么设置?
笔记本4k高分屏看起来眼睛要瞎了,求大佬做下适配
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.