Java: How to unescape HTML character entities in Java? -
basically decode given html document, , replace special chars, such " "
-> " "
, ">"
-> ">"
.
in .net can make use of httputility.htmldecode
.
what's equivalent function in java?
i have used apache commons stringescapeutils.unescapehtml4() this:
unescapes string containing entity escapes string containing actual unicode characters corresponding escapes. supports html 4.0 entities.
Comments
Post a Comment