Giter Club home page Giter Club logo

Comments (11)

AngusDavis avatar AngusDavis commented on May 19, 2024

@nevillelyh - Can you give an estimate of how deep the nested records go?

from hadoop-connectors.

nevillelyh avatar nevillelyh commented on May 19, 2024

4-5 layers, and some of them are Array or Map.

from hadoop-connectors.

nevillelyh avatar nevillelyh commented on May 19, 2024

This is also happening to Avro files exported via bigquery console. Should I file a bug to the BQ folks?

from hadoop-connectors.

nevillelyh avatar nevillelyh commented on May 19, 2024

Hi I'm hitting this issue again with Dataflow Java SDK now that it exports BigQuery as Avro.
Here are some example files that trigger the error:
gs://neville-steel-eu/avro-test/

It might be related to this:
http://stackoverflow.com/questions/24130615/circular-references-not-handled-in-avro

Can anyone look into it?

from hadoop-connectors.

AngusDavis avatar AngusDavis commented on May 19, 2024

@nevillelyh - Can you provide a full object name in your avro-test bucket that has public read access? I don't have permission to list objects within that bucket.

from hadoop-connectors.

AngusDavis avatar AngusDavis commented on May 19, 2024

While not a stack overflow, I've managed to produce bad avro files (parse errors that shouldn't be parse errors). I'm currently filing a bug with BQ, but your test case would also be nice in case they are separate bugs.

from hadoop-connectors.

dhalperi avatar dhalperi commented on May 19, 2024

@AngusDavis and I are collaborating internally and I can share with him what he needs -- pretty sure these are the same underlying cause at the end of the day.

from hadoop-connectors.

nevillelyh avatar nevillelyh commented on May 19, 2024

@AngusDavis I've made the avro file public: gs://neville-steel-eu/avro-debug/000000000000
@dhalperi yes that's my guess too. Thanks for looking into this!

from hadoop-connectors.

AngusDavis avatar AngusDavis commented on May 19, 2024

@nevillelyh - Thanks, got it.

from hadoop-connectors.

dhalperi avatar dhalperi commented on May 19, 2024

Hi Neville, I believe the underlying bug in the BigQuery Avro file generator has been fixed. Thanks for the report and the reproduction -- this was crucial to successful resolution!

from hadoop-connectors.

AngusDavis avatar AngusDavis commented on May 19, 2024

@dhalperi, @nevillelyh - Thanks both. Marking this issue closed.

from hadoop-connectors.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.