0
thanks for your help but i think i couldn't able to deliver my idea to you that basically what i want.
the fist thing i want is to convert HTML to RDF, i tried your suggested "SEMWEB .NET LIBRARY" but it's only converting RDF to N3 and i want HTML to RDF. i don't whether im missing something or if i don't know how to use, i read all the documentation for it but still i m unable to do it.
second thing is i have to generate a book. for example, suppose the book is on "Computer Architecture" and it contains the sub-topics such as INTERRUPT", "CACHE", "MEMORY" and so on. now suppose if user wants to extract the info about "CACHE" then how will i be able to do it. whether parsing will be performed? if yes, then how the text will be parsed?
i look forward to your reply.
0
In you're in foront of a given XML format then you have to alternatives to handle the sitation
you can use the DOM System.xml (xmldocument.selectNode(s) to select a single node or a node list and xmlnode.InnerText to secet the text within a given node) or
you can use the XLing(build an xlinq request to get either the node or its content ) Sytem.xml.Linq
And the second alternative is the best
0
could you please guide me about sentence pruning and sentence re-ordering to generate the book format like first should be (Content, then Introduction, then Overview and so on)
the question is that how could i select the required text in the given three books. how the searching will be performed?
such as i have to generate a book on (Lets say) "INTERRUPT" and "INTERRUPT" is a subtopic in all given three books, now how could the book will b formed at runtime?
Hope the Idea is clear! :)
0
sources can be any text file and HTML
0
Are the sources all from HTML or the sources are heterogenious?