Comments (7)
Some more error message context:
2024-04-05 12:58:20,105 1607 ERROR _handle_rpc_error GRPC Error received
Traceback (most recent call last):
File "/databricks/spark/python/pyspark/sql/connect/client/core.py", line 1485, in _execute_and_fetch_as_iterator
for b in generator:
File "/usr/lib/python3.10/_collections_abc.py", line 330, in __next__
return self.send(None)
File "/databricks/spark/python/pyspark/sql/connect/client/reattach.py", line 133, in send
if not self._has_next():
File "/databricks/spark/python/pyspark/sql/connect/client/reattach.py", line 194, in _has_next
raise e
File "/databricks/spark/python/pyspark/sql/connect/client/reattach.py", line 166, in _has_next
self._current = self._call_iter(
File "/databricks/spark/python/pyspark/sql/connect/client/reattach.py", line 280, in _call_iter
raise e
File "/databricks/spark/python/pyspark/sql/connect/client/reattach.py", line 263, in _call_iter
return iter_fun()
File "/databricks/spark/python/pyspark/sql/connect/client/reattach.py", line 167, in <lambda>
lambda: next(self._iterator) # type: ignore[arg-type]
File "/databricks/python/lib/python3.10/site-packages/grpc/_channel.py", line 426, in __next__
return self._next()
File "/databricks/python/lib/python3.10/site-packages/grpc/_channel.py", line 826, in _next
raise self
grpc._channel._MultiThreadedRendezvous: <_MultiThreadedRendezvous of RPC that terminated with:
status = StatusCode.INTERNAL
details = "[INSUFFICIENT_PERMISSIONS] Insufficient privileges:
User does not have permission SELECT on any file. SQLSTATE: 42501"
debug_error_string = "UNKNOWN:Error received from peer unix:/databricks/sparkconnect/grpc.sock {grpc_message:"[INSUFFICIENT_PERMISSIONS] Insufficient privileges:\nUser does not have permission SELECT on any file. SQLSTATE: 42501", grpc_status:13, created_time:"2024-04-05T12:58:20.104583977+00:00"}"
from cobrix.
Do you read the copybook and the data file via the RDD API? If so, this is the likely cause, as the RDD API is not supported by DataBricks in the Unity Catalog: https://learn.microsoft.com/en-us/azure/databricks/compute/access-mode-limitations#spark-api-limitations-for-unity-catalog-shared-access-mode
from cobrix.
@schwalldorf , Thanks for the interest in the project. Very glad you like it!
- We use Hadoop client directly to load the copybook ( )
- We use RDD for variable length files for
- building indexes ( )
- for reading data files ( )
What is the Databrics-supported alternative for reading data files concurrently from Spark?
from cobrix.
Hi Ruslan,
thanks a lot for your reply.
DataBricks supports both the DataFrame API and the Dataset API. I think the Dataset API should be closer to RDDs, but I'm not an expert in this. And I wouldn't know how to easily rewrite your code.
from cobrix.
Sure. Let's keep this issue open. This is something we might look at at some point. In the meantime somebody might suggest a workaround.
from cobrix.
Hi there,
I am also encountering this issue described in #665. I'm looking forward to any updates or workarounds that might be available. Following this for any progress.
Thanks!
from cobrix.
So far no progress on this since I don't have access to a Databricks instance at the moment. But this might change during the year, will keep in mind to fix it
from cobrix.
Related Issues (20)
- Can I get the raw record bytes from ebcdic file w/out parsing HOT 4
- BBBB in copybook HOT 3
- Is it possible to read a nested Binary Field? HOT 1
- Record length option is ignored when generate record id is turued on
- Add CI/CD for automatic releases
- Reading EBCDIC file with multiple structure HOT 1
- Reading Variable Length File with OCCCURS DEPENDING HOT 12
- NoClassDefFoundError: Could not initialize class za.co.absa.cobrix.cobol.parser.decoders.FloatingPointDecoders$ HOT 3
- Not able to parse the content correctly when copybook has OCCURS X TIMES DEPENDING ON FIELD_NAME HOT 3
- Support for decimal scaling PV HOT 6
- Can't read multiple main headers defined in single copybook HOT 4
- Add support for parsing copybooks given Spark options
- Missing SIgn for few fileds that are negative HOT 5
- How to read a pipe separated file with Cobrix HOT 3
- PIC S9(10)V USAGE COMP-3 is converted to long instead of Decimal(10,0) HOT 4
- comp-3 values parsing issues HOT 2
- Shade ANTLR runtime in the parser to avoid ANTLR potential incompatibility issues
- Under some circumstances Cobrix selects wrong record reader failing the Spark job
- Add a feature to collapse structs or the output data
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from cobrix.