如何替换xml文件中的字符串的一部分?

时间:2016-10-04 04:09:45

标签: java xml parsing dom xpath

我有一个xml文件,如下所示:

<Verbiage>
        The whiskers plots are based on the responses of incarcerated 
        <Choice>
            <Juvenile> juveniles who have committed sexual offenses. </Juvenile>
            <Adult> adult sexual offenders. </Adult>
        </Choice> 
        If the respondent is a 
        <Choice>
            <Adult>convicted sexual offender, </Adult>
            <Juvenile>juvenile who has sexually offended, </Juvenile>
        </Choice> 
        #his/her_lc# percentile score, which defines #his/her_lc# position 
        relative to other such offenders, should be taken into account as well as #his/her_lc# T score. Percentile 
        scores in the top decile (> 90 %ile) of such offenders suggest that the respondent 
        may be defensive and #his/her_lc# report should be interpreted with this in mind.
    </Verbiage>

我正在尝试找到解析xml文件的方法(我一直在使用DOM),搜索#his / her_lc#并将其替换为&#34; her&#34;。我尝试过使用FileReader,BufferedReader,string.replaceAll,FileWriter,但那些没有用。

我有办法使用XPath吗?

最终,我想在此xml文件中搜索此字符串,并将其替换为另一个字符串。

我是否必须在字符串周围添加一个标签,我希望它以这种方式解析它?

我试过的代码:

protected void parse() throws ElementNotValidException {
    try {
        //Parse xml File
        File inputXML = new File("template.xml");
        DocumentBuilderFactory parser = DocumentBuilderFactory.newInstance(); // new instance of doc builder
        DocumentBuilder dParser = parser.newDocumentBuilder(); // calls it
        Document doc = dParser.parse(inputXML); // parses file

        FileReader reader = new FileReader(inputXML);
        String search = "#his/her_lc#";
        String newString;

        BufferedReader br = new BufferedReader(reader);
        while ((newString = br.readLine()) != null){
            newString.replaceAll(search, "her");
        }

        FileWriter writer = new FileWriter(inputXML);
        writer.write(newString);
        writer.close();

    } catch (ParserConfigurationException e) {
        e.printStackTrace();
    } catch (IOException e) {
        e.printStackTrace();
    } catch (SAXException e) {
        e.printStackTrace();
    }

代码我被解决了:

try {
        File inputXML = new File("template.xml"); // creates new input file
        DocumentBuilderFactory parser = DocumentBuilderFactory.newInstance(); // new instance of doc builder
        DocumentBuilder dParser = parser.newDocumentBuilder(); // calls it
        Document doc = dParser.parse(inputXML); // parses file
        doc.getDocumentElement().normalize();

        NodeList pList = doc.getElementsByTagName("Verbiage"); // gets element by tag name and places into list to begin parsing

        int gender = 1; // gender has to be taken from the response file, it is hard coded for testing purposes
        System.out.println("----------------------------"); // new line

        // loops through the list of Verbiage tags
        for (int temp = 0; temp < pList.getLength(); temp++) {
            Node pNode = pList.item(0); // sets node to temp

            if (pNode.getNodeType() == Node.ELEMENT_NODE) { // if the node type = the element node
                Element eElement = (Element) pNode;
                NodeList pronounList = doc.getElementsByTagName("pronoun"); // gets a list of pronoun element tags

                if (gender == 0) { // if the gender is male

                    int count1 = 0;
                    while (count1 < pronounList.getLength()) {

                        if ("#he/she_lc#".equals(pronounList.item(count1).getTextContent())) {
                            pronounList.item(count1).setTextContent("he");
                        }

                        if ("#he/she_caps#".equals(pronounList.item(count1).getTextContent())) {
                            pronounList.item(count1).setTextContent("He");
                        }

                        if ("#his/her_lc#".equals(pronounList.item(count1).getTextContent())) {
                            pronounList.item(count1).setTextContent("his");
                        }
                        if ("#his/her_caps#".equals(pronounList.item(count1).getTextContent())) {
                            pronounList.item(count1).setTextContent("His");
                        }

                        if ("#him/her_lc#".equals(pronounList.item(count1).getTextContent())) {
                            pronounList.item(count1).setTextContent("him");
                        }
                        count1++;
                    }
                    pNode.getNextSibling();

                } else if (gender == 1) { // female
                    int count = 0;
                    while (count < pronounList.getLength()) {

                        if ("#he/she_lc#".equals(pronounList.item(count).getTextContent())) {
                            pronounList.item(count).setTextContent("she");
                        }

                        if ("#he/she_caps3".equals(pronounList.item(count).getTextContent())) {
                            pronounList.item(count).setTextContent("She");
                        }

                        if ("#his/her_lc#".equals(pronounList.item(count).getTextContent())) {
                            pronounList.item(count).setTextContent("her");
                        }
                        if ("#his/her_caps#".equals(pronounList.item(count).getTextContent())) {
                            pronounList.item(count).setTextContent("Her");
                        }

                        if ("#him/her_lc#".equals(pronounList.item(count).getTextContent())) {
                            pronounList.item(count).setTextContent("her");
                        }
                        count++;
                    }
                    pNode.getNextSibling();
                }
            }
        }
        // write the content to file
        TransformerFactory transformerFactory = TransformerFactory.newInstance();
        Transformer transformer = transformerFactory.newTransformer();
        DOMSource source = new DOMSource(doc);

        System.out.println("-----------Modified File-----------");
        StreamResult consoleResult = new StreamResult(System.out);
        transformer.transform(source, new StreamResult(new FileOutputStream("template.xml"))); // writes changes to file
    } catch (Exception e) {
        e.printStackTrace();
    }

}

如果我能弄清楚如何将标签代词与此代码所在的pronounParser相关联,我认为这段代码会有效。

2 个答案:

答案 0 :(得分:2)

我使用了这个例子和你的template.xml,我认为它可行。

public static void main(String[] args) {

        File inputXML = new File("template.xml");
        BufferedReader br = null;
        String newString = "";
        StringBuilder strTotale = new StringBuilder();
        try {

        FileReader reader = new FileReader(inputXML);
        String search = "#his/her_lc#";


        br = new BufferedReader(reader);
        while ((newString = br.readLine()) != null){
            newString = newString.replaceAll(search, "her");
            strTotale.append(newString);
        }

        } catch ( IOException  e) {
            // TODO Auto-generated catch block
            e.printStackTrace();
        } // calls it
        finally
        {
            try {
                br.close();
            } catch (IOException e) {
                // TODO Auto-generated catch block
                e.printStackTrace();
            }
        }


        System.out.println(strTotale.toString());


    }

首先,你必须重新分配replaceAll的结果:

newString = newString.replaceAll(search, "her");

其次我使用StringBuffer收集所有行。

我希望这有帮助。

答案 1 :(得分:0)

<块引用>

由于字符串是不可变的,你不能修改它们,使用 字符串生成器/字符串缓冲区 而不是字符串。

FileReader reader = new FileReader(inputXML);
    String search = "#his/her_lc#";
    String newString;
    StringBuffer str;

    BufferedReader br = new BufferedReader(reader);
    while ((newString = br.readLine()) != null){
        str.append(newString.replaceAll(search, "her"));
    }

    FileWriter writer = new FileWriter(inputXML);
    writer.write(str);
    writer.close();