I have found a solution to this problem. QWebFrame has a function called hitTestContent(QPoint). When given a QPoint it returns a QWebHitTestResult which provides info about the area that was just hit. If there is an image at that location it has a method called pixmap() which returns the QPixmap of that image.

If you have any further questions about this, please do ask and I will explain further.