This repo contains code to convert Structured Documents to Graphs and implement a Graph Convolution Neural Network for node classification

Python 100.00%

computer-vision deep-learning geometric-deep-learning graph-convolution

graph-convolution-on-structured-documents's People

Contributors

Stargazers

Watchers

graph-convolution-on-structured-documents's Issues

How to extract actual data for entity

Hi,

Amazing work to understand structure documents. Is it possible to extract actual value for given entity. For example invoice #, total amount, company name, etc...

Thank you

graph_dict object wont preserve both neighbours of node (right side and below)

See here

graph_dict = {}
for src_id, row in df.iterrows():
    if row['below_obj_index'] != -1:
         graph_dict[src_id] = [row['below_obj_index']]
    if row['side_obj_index'] != -1:
	 graph_dict[src_id] = [row['side_obj_index']]

What's the next step after generating the graph image and the connections.csv

Hello, first of all thank you for your valuable work, i wanted to ask you on how should i proceed after generating the graph png image and the connections.csv file, how do i feed that to the Graph Convolutional Neural Network. Thanks in a dvance.

"Passing list-likes to .loc or [] with any missing labels is no longer supported. "

	df['below_object'] = df.loc[nearest_dest_ids_vert, 'Object'].values

Because of this line I am getting this error. Any idea for avoiding this error ?

AttributeError: 'Adjacency' object has no attribute 'w0'

Hello ! I.m very interesting in your research. Nice work !
when I run model.compile(), the flow arror generate:
assert adj_list[0].shape[0] == self.w0.shape[0], f'The number of rows
AttributeError: 'Adjacency' object has no attribute 'w0'

Model training and data extraction

Hello,
After formation of adjacency and feature matrix, we are looking to train our model but for that, how can we include the labels and how will it give the data extraction.
Kindly explain this query.
Thanks in advance.!

Error in dimension of Adjacency matrix without padding

@dhavalpotdar
In some of the images, the dimension of the adjacency matrix is not matching with the no of words in the image. For example -
Suppose the image has 91 words then the adjacency matrix was of shape (90, 90) instead of (91, 91).
One word/node is cut off from the adjacency matrix.

grapher.py running problem

I try to run the code grapher.py, and the next line

		# ==================== vertical ===================================== #
		# create df for plotting lines
		df['below_object'] = df.loc[nearest_dest_ids_vert, 'Object'].values

gives the next error

KeyError: "Passing list-likes to .loc or [] with any missing labels is no longer supported. The following labels were missing: Int64Index([-1, -1, -1, -1, -1, -1], dtype='int64'). See https://pandas.pydata.org/pandas-docs/stable/user_guide/indexing.html#deprecate-loc-reindex-listlike"

What is the solution?

AssertionError: Expected type <class 'str'>. Received <class 'float'> & IndexError: index 22 is out of bounds for axis 1 with size 22

the code A, X = graph.make_graph_data(graph_dict, text_list) is throwing an assertion error
Expected type <class 'str'>. Received <class 'float'>:

graph_dict appears to be having float values: {0: [4.0], 4: [5.0], 5: [1.0]}

After debugging all the open issues error this appears to be the last one occuring at the last line of the code. Will appreciate anyhelp to run that Grapher.py successfully.

dhavalpotdar / graph-convolution-on-structured-documents Goto Github PK

graph-convolution-on-structured-documents's People

Contributors

Stargazers

Watchers

Forkers

graph-convolution-on-structured-documents's Issues

Recommend Projects

Recommend Topics

Recommend Org