vb6 - How to strip all HTML tags and entities and get CLEAR text in Visual Basic? -


how can 1 using visual basic (in case 6), strip html tags , plain text? able accomplish html purifier, in php. there function or class or script in vb6 lets me this, need process pages on 5mbs , it's not efficient in php.

so, again, how convert this:

<!doctype html> <html> <head> <title>title</title> </head> <body> <p>paragraph 1</p> <div>section</div> hello! </body> </html> 

to, let's this:

paragraph 1 section hello! 

i wanted make api system this, found out it's not going reliable.

p.s.: doing making crawler search engine, , have experience in vb , php.

thanks in advanced.

i have snipped c# ... can port vb easy :)

/// <summary> /// remove html string regex. /// </summary> public static string striptagsregex(string source) {             return regex.replace(source, "<.*?>", string.empty); } 

Comments