【问题标题】:DOM parser doesn't see subnodesDOM 解析器看不到子节点
【发布时间】:2014-02-11 13:39:32
【问题描述】:

我试图在 DOM 解析器的帮助下解析 Lingvo xml 字典。

问题: DOM 解析器看不到 card 节点的子节点(参见下面的代码)。

问题?:如何从card 节点拉出wordtranslation 节点

我的代码:

import entity.Item;
import org.w3c.dom.Document;
import org.w3c.dom.Element;
import org.w3c.dom.Node;
import org.w3c.dom.NodeList;
import org.xml.sax.SAXException;

import javax.xml.parsers.DocumentBuilder;
import javax.xml.parsers.DocumentBuilderFactory;
import javax.xml.parsers.ParserConfigurationException;
import java.io.IOException;
import java.util.ArrayList;
import java.util.List;

public class DOMParser {

    public void parseXMLFile(String xmlFilePath) throws IOException, SAXException {
        Document document = builder.parse(ClassLoader.getSystemResourceAsStream(xmlFilePath));
        List<Item> itemList = new ArrayList<Item>();
        NodeList nodeList = document.getDocumentElement().getChildNodes();
        //iterates through cards
        for (int i = 0; i < nodeList.getLength(); i++) {
            Node node = nodeList.item(i);
            System.out.println(node.getNodeName());
            if (node instanceof Element) {
                if ("card".equals(node.getNodeName())) {
                    // HERE node hasn't got anything!!! I mean attributes, childs etc.
                } 
            }
        }
    }
}

我的 xml:

<?xml version="1.0" encoding="UTF-16"?>
<dictionary formatVersion="5" title="User ;vocabulary_user1" sourceLanguageId="1058" destinationLanguageId="1033" nextWordId="611" targetNamespace="http://www.abbyy.com/TutorDictionary">
    <statistics readyMeaningsQuantity="90" activeMeaningsQuantity="148" learnedMeaningsQuantity="374" />
    <card>
        <word>загальна цікавість</word>
        <meanings>
            <meaning>
                <statistics status="4" answered="122914" />
                <translations>
                    <word>genaral wondering</word>
                </translations>
            </meaning>
        </meanings>
    </card>
</dictionary>

【问题讨论】:

标签: java xml parsing dom xml-parsing


【解决方案1】:

您可以使用递归方法读取所有内容,而不会陷入嵌套的 for 循环的混乱中。

对于您的 xml:

public static void main(String[] args) throws ParserConfigurationException,
            SAXException, IOException {
        InputStream path = new FileInputStream("dom.xml");
        DocumentBuilderFactory factory = DocumentBuilderFactory.newInstance();
        DocumentBuilder builder = factory.newDocumentBuilder();
        Document document = builder.parse(path);
        traverse(document.getDocumentElement());

    }

    public static void traverse(Node node) {
        NodeList list = node.getChildNodes();
        for (int i = 0; i < list.getLength(); i++) {
            Node currentNode = list.item(i);
            traverse(currentNode);

        }

        if (node.getNodeName().equals("word")) {
            System.out.println("This -> " + node.getTextContent());
        }

    }

给予,

This -> загальна цікавість
This -> genaral wondering

【讨论】:

    猜你喜欢
    • 1970-01-01
    • 2011-01-04
    • 2016-10-01
    • 1970-01-01
    • 2021-09-23
    • 2015-02-17
    • 1970-01-01
    • 2019-02-18
    • 2013-03-23
    相关资源
    最近更新 更多