Comments (8)
Thanks for the report - I will investigate this issue.
from anglesharp.
Hm I cannot reproduce this issue. Can you confirm that the following selector gets the element which causes the possible problems: document.QuerySelector("img.img-responsive")
?
I pretty much compared AngleSharp's result with the current results delivered by the Blink engine (Opera 28) and both are the same. Am I missing something?
from anglesharp.
Yes, that's the one. It has an attribute named documentary." that includes the period, backslash and quote. I'm using the nuget assembly, version 0.7.0.37131. Is that too old?
from anglesharp.
I updated the assembly and verified that the same problem occurs with 0.8.1 as well.
Incidentally, I'm reading the file with
var html = File.ReadAllText(filename).
And parsing with an empty Configuration
object.
Also: Windows 8.1, .net 4.5.
from anglesharp.
It's supposed to be like that. So this is not an issue, but rather one of HTML's "features". Please compare to the implementations in all HTML5 conforming browsers.
from anglesharp.
Just so I understand, could you explain this a bit more please?
from anglesharp.
Well, what else is there to say. I could write about tokenization in HTML5 documents, but all you need to know is that the given snippet that is responsible for the image element generation contains a dot and a backslash, which are legal characters in an attribute name. Also quotation marks are allowed. However, quotation marks (among some other characters) are parse errors.
These errors will never result in an exception or anything bad, but are indications that the page might not result in the desired tree.
HTML5 contains a lot of "features" that are included to be as backward compatible (not only with HTML, but also with former HTML error correction which was freely implemented by browser vendors, as there was no official error correction) as possible.
from anglesharp.
Understood. Some of my code converts the parsed Html into Xml (i.e. x-html) so I guess I'll have to eliminate tags that are unrecognized.
Great job on compatibility with the standard HTML5 parsing algorithm.
from anglesharp.
Related Issues (20)
- Request for Support / Sponsorship HOT 1
- IHtmlDocument has IDisposable - for what?) HOT 1
- Redirect to Custom URL Scheme HOT 2
- Issues with Headers HOT 2
- Use libraries provided by framework HOT 3
- Provide repo link as part of nuget package HOT 8
- QuerySelectorAll problem HOT 1
- SemVer scheme in AngleSharp -alpha versions broken HOT 1
- Multipart/form-data support HOT 1
- IndexOutOfRangeException in AngleSharp.Common.ArrayPoolBuffer.Append HOT 1
- Getting Attributes for each Element HOT 1
- InvalidOperationException: Stack empty in AngleSharp.Html.Parser.HtmlDomBuilder HOT 1
- Attributes in Elements HOT 3
- Parser Issue Findings from Fuzzing HOT 4
- Additional Findings from Fuzzing HOT 3
- NullReferenceException when using own HttpClient HOT 1
- How to Change the Accept header in DocumentRequest HOT 1
- Link element source not loaded when element is appended to document
- AngleSharp 1.2.0-beta.410 can not be loaded in .Net Framework because of System.Memory version error HOT 3
- obsolete reference HOT 2
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from anglesharp.