[PyKDE] How do you get HTML source from konqueror/KHTMLPart?
yichun.wei at gmail.com
Thu Dec 21 01:40:03 GMT 2006
On 12/20/06, Marcos Dione <mdione at grulic.org.ar> wrote:
> On Wed, Dec 20, 2006 at 10:59:06AM -0800, yichun wei wrote:
> > I am trying to grab some html pages via KHTMLPart.openURL and scrape
> > the content I get. However I am not able to read out the HTML document
> > sources I have in KHTMLPart.
> just call:
> domDocu= part.document ()
> html= domDocu.toString ().string ()
> that's a QString.
> > toHTML() seemed to return nothing (None or ""), while toString() gave
> > me an exception and my script crashed:
> yes, under certain circumstances that happens. I think it's because
> the KHTMLPart has no parentWidet or no parent or both. if you setup the
> whole apparatus for showing the part, everythings works just fine.
Thanks a lot Marcos. I was using Jim Bublitz's
doc/examples/pyKHTMLPart.py and modified it from there. It appears to
me that parentWidget for the KHTMLPart is not 0:
def __init__(self, *args):
self.w = KHTMLPart(self, "HTMLart", self)
domDocu = self.w.document ()
html = domDocu.toString().string()
Then I got the error message:
terminate called after throwing an instance of 'DOM::DOMException'
where .toString() is called. Is there any other reason leading to this
other than the 0 values when initiate the KHTMLPart object?
More information about the PyQt