Apache OpenOffice (AOO) Bugzilla – Issue 83494
Implement XHTML import filter with MathML & XForms support
Last modified: 2013-02-07 22:32:32 UTC
we can use new process of importing both HTML&XHTML files 1. if file is valid XHTML 1.0 Strict - goto 3 2. sanitize source with tidy (http://www.w3.org/People/Raggett/tidy/) 3. perform import of XHTML 1.0 Strict i guess it will have more sense in terms of "feature-proof" - it allow to drop support for old "street HTML" filter and free some C++ developer resources, move transformation logic to XSLT and have one importing logic for both XHTML&HTML sources. Also it will be good for tidy project - extended testing base. Some open office issues will be directed to tidy engineers. Some bugs will be fixed even without issue report to OpenOffice.org (just after upgrading tidy). Also tidy may be used for prepare not XHTML web-pages for extracting data using XPath and XQuery. As XSLT developer i can help in greating XHTML import filter.
Reassigned to JSI.
@sus: FRom my point of view issue 83249 is duplicate to this one.
changing components because XHTML is based on 'xmloff' and there is no XHTML implementation directly in 'word processor' code.
*** Issue 83249 has been marked as a duplicate of this issue. ***
I guess this is more an enhancement to be figured out by our requirement guys. Changed owner...
Created attachment 56480 [details] XSLT 2.0 first iteration of XHTML import sample
i post first iteration of XHTML import filter (XSLT 2.0) - it already support MathML, encoded images (same as XHTML export), support of XForms coming soon at least it better then plain text so i suggest to include this as XSLT 2.0 import filter sample to OO 3.0 so peoples can be involved for this filter testing and evolution
@xquery: it's great to see the development of an xhtml import filter. Has anything happened/changed since September 08? Is it planned to include it in the default install set of OpenOffice.org? How can I install and test your filter? Thank you Gerald
it was just possibility testing, for now this task stalled at requirments from last Tuesday i am leaved Wiley so i can spend some extra time at this task Gerald, as "first customer" and voter on this task you can help by providing some xhtml as sample and testcase
Created attachment 59809 [details] new
Hello, are there any news on the XHTML import filter? As I was trying the last updated filter by xquery (attachment from Feb1), I was impressed, because it is much better than the html import, which is built in in OpenOffice.org. When will it be included in the default install set? Furthermore, are there any news on the import of external CSS during XHTML import? Thank you Gerald
i can implement it if someone will sponsor my development (about 900$ per month i need just to survive) i guess it can consume about 3-4 months (with CSS & XForms support) so it will be 2700$ - 3600$ total