Comments (1)
I finally solved this problem. According to the method described at https://github.com/sxzhang1993/Run-cutlass-with-gpgpu-sim, it uses CUDA 9.1. In CUDA 9.1, the generated .loc instructions only have the first syntax, not the second syntax. However, CUDA 9.1 does not support the Turing architecture. If you want to use the Turing architecture, you can use CUDA 11, but the aforementioned problem will occur. I found that .loc is related to debugging. In cutlass_bench, the -lineinfo option is added during compilation. If we omit this option, no .loc instructions will be generated. We can comment out the -lineinfo option in cutlass_bench/CMakeLists.txt, and the final generated PTX will not contain .loc instructions. However, using GPGPU-Sim 4.0 will cause the error mentioned in #247. We need to use GPGPU-Sim 4.2.
from gpgpu-sim_distribution.
Related Issues (20)
- how to get the Logic and arithmetic instructions of the PTX-level
- Resolve GCC Warning and Address Potential Bug in Checkpoint Functionality
- When running in gdb, an error occurs
- DVFS in GPGPU-sim
- make[1]: *** [Makefile:76: depend] Error 127 & make: *** [Makefile:207: cuda-sim] Error 2 in RHEL 9 HOT 20
- RuntimeError: cublas runtime error(gpgpu-sim with pytorch) HOT 1
- Deadlock when scaling up problem sizes
- ptx_parse() fuction doesn't return when executing different applications HOT 3
- Does gpgpu-sim support CUDA driver api? HOT 2
- How to complie it in ubuntu 22.04? HOT 2
- [Question] Using `gpgpu_ptx_sim_load_ptx_from_string` without affecting original `gpgpu_context`? HOT 1
- Independent Warp Scheduling in Volta+'s SIMT model HOT 1
- Does not natively support emulation within python runtime: HOT 1
- gpgpu-sim v4: Increasing DRAM Frequency has no impact on performance
- make: Getting issue with the g++ command in Ubuntu 22.04
- Question about L2 configuration
- config file doesn't modify number of MSHRs
- PTX CUDA API: Segmentation fault, stuck in loop
- it seems that the CTA issued to the core will be identical for value of "gpgpu_concurrent_kernel_sm" being true and false HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from gpgpu-sim_distribution.