HtmlFile
HtmlFile对象是指通过Excel VBA代码自动化Internet Explorer浏览器并打开Web页面后,将Web页面的HTML源代码下载到Excel VBA中生成的对象。该对象可以方便地进行搜索、解析和操作Web页面数据,以实现在Excel中对Web数据进行编辑、分析和处理的目标。
以下是三个基本的代码实例,展示了如何使用HtmlFile对象:
1、获取Web页面内容:
Dim ie As Object
Dim htDoc As Object
Set ie = CreateObject("InternetExplorer.Application")
ie.Visible = True
ie.Navigate "https://www.example.com"
Do While ie.Busy Or ie.readyState <> 4
DoEvents
Loop
Set htDoc = ie.document
Dim htmlText As String
htmlText = htDoc.body.innerHTML
MsgBox ("Web页面内容:" + htmlText)
ie.Quit
Set ie = Nothing
2、搜索Web页面元素:
Dim ie As Object
Dim htDoc As Object
Set ie = CreateObject("InternetExplorer.Application")
ie.Visible = True
ie.Navigate "https://www.example.com"
Do While ie.Busy Or ie.readyState <> 4
DoEvents
Loop
Set htDoc = ie.document
Dim searchResult As Object
Set searchResult = htDoc.getElementsByTagName("div")
For Each ele In searchResult
If ele.className = "classA" Then
MsgBox ("找到classA元素")
Exit For
End If
Next ele
ie.Quit
Set ie = Nothing
3、以表格形式显示Web页面数据:
Dim ie As Object
Dim htDoc As Object
Set ie = CreateObject("InternetExplorer.Application")
ie.Visible = True
ie.Navigate "https://www.example.com"
Do While ie.Busy Or ie.readyState <> 4
DoEvents
Loop
Set htDoc = ie.document
Dim tableData As Object
Set tableData = htDoc.getElementById("tableId")
Dim htmlTable As Object
Set htmlTable = tableData.getElementsByTagName("table")(0) '假设只有一个表格
Dim tblRow As Object
Dim tblCell As Object
Dim rowIndex As Integer
Dim colIndex As Integer
rowIndex = 1 '初始化表格起始行
colIndex = 1 '初始化表格起始列
For Each tblRow In htmlTable.Rows
For Each tblCell In tblRow.Cells
ActiveSheet.Cells(rowIndex, colIndex).Value = tblCell.innerText
colIndex = colIndex + 1
Next tblCell
rowIndex = rowIndex + 1
colIndex = 1 '恢复表格列计数器
Next tblRow
ie.Quit
Set ie = Nothing
HtmlProjectItems
它可以代表VBA项目中的 HTML元素或文件。这个对象是从VBProject对象的WebComponents集合中得到的。通过HtmlProjectItems对象,开发人员可以编程控制HTML元素和文件,以及读取或写入其中的内容。例如,可以通过遍历HtmlProjectItems集合来获取所有HTML文件的列表,并逐个打开或关闭这些文件;还可以通过HtmlProjectItems对象访问HTML输入框、按钮等控件的属性和方法,来实现与用户交互的功能。
以下是3个使用HtmlProjectItems对象的Excel VBA代码示例:
1、获取所有HTML文件名并逐一打开
Dim wb As Workbook
Dim proj As VBIDE.VBProject
Dim htm As Object
Set wb = ThisWorkbook
Set proj = wb.VBProject
For Each htm In proj.VBComponents("Sheet1").Designer.HTMLProject.HTMLProjectItems
If htm.Type = vbext_ct_HtmlPage Then
Workbooks.Open htm.SourcePath & "\" & htm.Name
End If
Next htm
2、读取HTML页面某个元素的属性值
Dim wb As Workbook
Dim proj As VBIDE.VBProject
Dim htm As Object
Dim element As Object
Set wb = ThisWorkbook
Set proj = wb.VBProject
Set htm = proj.VBComponents("Sheet1").Designer.HTMLProject.HTMLProjectItems("example.html")
Set element = htm.webdocument.getElementById("example-element")
MsgBox element.getAttribute("class")
3、修改HTML页面中的文本
Dim wb As Workbook
Dim proj As VBIDE.VBProject
Dim htm As Object
Dim element As Object
Set wb = ThisWorkbook
Set proj = wb.VBProject
Set htm = proj.VBComponents("Sheet1").Designer.HTMLProject.HTMLProjectItems("example.html")
Set element = htm.webdocument.getElementById("example-element")
element.innerText = "New Text Value"
在使用HtmlFile和HtmlProjectItems对象时,需要注意以下几点:
1、HtmlFile和HtmlProjectItems对象只能用于Excel中的宏代码,不能在其它Office应用程序或VB6中使用。
2、使用这两个对象需要先添加"Microsoft Internet Controls"引用。在VBA编辑器下,选择菜单栏中的"工具"->"引用",勾选“Microsoft Internet Controls”即可。
3、HtmlFile对象代表单个HTML文件,而HtmlProjectItems对象代表整个HTML项目,包括所有的HTML文件。
4、当使用HtmlProjectItems对象操作整个HTML项目时,需确认项目中是否包含HTML文件,否则会出现“Subscript out of range”等异常错误。
5、可以使用HtmlFile对象的Open和Close方法打开或关闭单个HTML文件,但是要避免同时修改多个HTML文件。
