I am trying to find page breaks in a .docx document using Apache POI. I do this to find the paragraph page number. The code I'm using is:
for (XWPFRun run : paragraph.getRuns()) { List<CTBr> brList = run.getCTR().getBrList(); if (brList != null && !brList.isEmpty()) { for (CTBr br : brList) { if (br.getType() == STBrType.PAGE) { //page break detected } } } else { List<CTEmpty> lastRenderedPageBreakList = run.getCTR().getLastRenderedPageBreakList(); if (lastRenderedPageBreakList != null) { for (CTEmpty lastRenderedPageBreak : lastRenderedPageBreakList) { //page break detected } } } }
The code works fine for most pages, but not for all. Does anyone have an idea of ββwhat I'm still missing?
java parsing ms-word docx apache-poi
dpalaka
source share