Giter Club home page Giter Club logo

Comments (10)

tagomoris avatar tagomoris commented on May 24, 2024

Many of webhdfs request failures make HTTP error response, and it will be raised as fluentd errors.
Can you post these below here?:

  • your configurations of hadoop's hdfs-site.xml (of course, with masking of sensitive parts)
  • output of curl -v webhdfs operation result without username

from fluent-plugin-webhdfs.

LanceNorskog avatar LanceNorskog commented on May 24, 2024

Ah, forgot to mention that. When I run curl, it accepts the 307 redirect (to a datanode) from Hadoop, then stops. I don't know whether it actually sends the request to the datanode. When I send the redirect URL to the datanode, that works.

I do not have access to the webhdfs system right now.

This is Hadoop version 1.0.3.

from fluent-plugin-webhdfs.

LanceNorskog avatar LanceNorskog commented on May 24, 2024

It started working when I added the username parameter. Is there a bug in propagating this error?

from fluent-plugin-webhdfs.

tagomoris avatar tagomoris commented on May 24, 2024

Hmm, i cannot understand why curl without username sucesses, and fluent-plugin-webhdfs without username failes.
Can you try this operation on irb from your client?

require 'webhdfs'
client = WebHDFS::Client.new('your.namenode.hostname', 50070)
client.list('/')
client.append('/tmp/test_file_webhdfs_client.txt', 'aaaaaaaaaaaaaaaaaaaaaaaa')

from fluent-plugin-webhdfs.

LanceNorskog avatar LanceNorskog commented on May 24, 2024

I'm on centos with ruby 0.9.5. 'gem install webhdfs' gets a buffer overflow. I don't know how to get the system to where your test wants it to be.

from fluent-plugin-webhdfs.

tagomoris avatar tagomoris commented on May 24, 2024

What do you do? Ruby '0.9.5' may not exists in this world...

$ which ruby
$ ruby -v
$ which gem
$ gem -v

from fluent-plugin-webhdfs.

satousan avatar satousan commented on May 24, 2024

Hello.
Can I ask some question about this module?

I started to try to use this module,
but I couldn't recognize my data appended into hdfs directory.
When I restart td-agent,only at that time, some data in td-agent buffer is correctly wrote to hdfs directry,
But, when I keep td-agent started, td-agent write nothing to HDFS directory.(new data is continuously comming to this server)
So,appending looks like not working collectly.
Would you have some idea to work collectly?

Below is my td-agent.conf

machine A(data forwerder)

type forward
host machineB

machine B(reciever and writer to hdfs)

type forward port 24224 type webhdfs host hdfshostname port 50070 path /tmp/tmp.log flash_interval 1s username hdfsuser

and all of hdfs cluster below is set true
-dfs.webhdfs.enabled
-dfs.support.append

-dfs.support.broken.append

from fluent-plugin-webhdfs.

tagomoris avatar tagomoris commented on May 24, 2024

@satousan:
specify flush_interval 1s instead of flash_interval (and please make new issue for more).

@LanceNorskog ping?

from fluent-plugin-webhdfs.

satousan avatar satousan commented on May 24, 2024

thanks ! for your help

from fluent-plugin-webhdfs.

tagomoris avatar tagomoris commented on May 24, 2024

Please re-open this issue if you have any problems now > @LanceNorskog

from fluent-plugin-webhdfs.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.