zhaohao / google-blog-converters-appengine

Automatically exported from code.google.com/p/google-blog-converters-appengine
Apache License 2.0
0 stars 0 forks source link

lj2b does not respect threaded comments #16

Open GoogleCodeExporter opened 9 years ago

GoogleCodeExporter commented 9 years ago
I don't know if Atom format supports threaded comments, but this
information is lost during conversion.

E.g. I have two comments in a thread
(http://darkk.livejournal.com/47327.html?thread=161247#t161247):

<comment id="629" jitemid="184" posterid="11430783">
  <body>Однако!</body>
  <date>2009-01-05T23:17:18Z</date>
</comment>
<comment id="630" jitemid="184" posterid="53346" parentid="629">
  <body>Пожелания? Предложения? Патчи? Ж-)</body>
  <date>2009-01-05T23:22:10Z</date>
</comment>

So parentid="629" is not written to blogger.xml.

Is it possible to fix that or is it unreasonable?

Original issue reported on code.google.com by mathemonkey on 14 Jan 2009 at 9:11

GoogleCodeExporter commented 9 years ago
Just a note — there may be deleted comment in the middle of tree so they 
would be
replaced with some placeholders if threading is going to be implemented.

Original comment by mathemonkey on 15 Jan 2009 at 5:09

GoogleCodeExporter commented 9 years ago
Excellent thing to know.  I'm learning more and more about the grungy details of
LiveJournal every day.

I'll look into the threading a bit in the near future.  Please feel free to 
take a
shot at it if you've got a solution in mind.

Original comment by jlu...@gmail.com on 15 Jan 2009 at 5:16

GoogleCodeExporter commented 9 years ago
I have not found good link describing blogger's Atom extentions and original 
RFC4287
is a bit too generalized.

atom:category scheme="http://schemas.google.com/g/2005#kind"
— the link points to nowhere, so I can't read details about the scheme

atom:category scheme="http://www.blogger.com/atom/ns#"
— looks like wrong scheme

But, seems, RFC4685 is the right document to read about the threading.

Original comment by mathemonkey on 15 Jan 2009 at 5:56

GoogleCodeExporter commented 9 years ago
I'm familiar with the namespaces that Blogger uses the Atom format defined in
RFC4287.  Since Atom does not allow nested entries, Blogger uses the Atom 
Threading
Extensions (RFC4685) to link the comments with the original post.

So, in the Blogger export format, there is a way to define which comments are 
linked
to the appropriate post.  Unlike LiveJournal, there is presently no way to link 
a
comment to another comment forming a thread.  Instead, all comments will be 
linked to
the original post.

It's unfortunate that this comment threading isn't supported in Blogger as it 
does
cause a loss of information during the conversion process.

There is some information on the use of this threading extension here: 
http://code.google.com/apis/blogger/docs/2.0/reference.html#LinkCommentsToPosts

That reference tells what the "http://schemas.google.com/g/2005#kind" scheme is 
for,
and the "http://www.blogger.com/atom/ns#" scheme is actually used to define
categories/tags to place on the posts.  There is not explicit documentation for 
this,
but there are examples of adding category/tags to posts here: 
http://code.google.com/apis/blogger/docs/2.0/developers_guide_protocol.html#Crea
tingPublicEntries

Original comment by jlu...@gmail.com on 15 Jan 2009 at 6:34

GoogleCodeExporter commented 9 years ago
However, unlike the in-reply-to header, the "in-reply-to" element is required to
identify the unique identifier of only a single parent resource.  
If the entry is a response to multiple resources, additional "in-reply-to" 
elements
MAY be used. © RFC4685

I don't know it blogger import will reject XML with multiply in-reply-to 
elements
(one element for post and another one for parent comment), I'll have to test 
that...
after exams :-)

Original comment by mathemonkey on 15 Jan 2009 at 7:25

GoogleCodeExporter commented 9 years ago
I have a feeling that Blogger will either be unable to import the document with
multiple in-reply-to elements, or will only use the first to link the comment 
to the
post, thus ignoring any links to parent comments.

Good luck with exams and again thanks for all of your feedback.

Original comment by jlu...@gmail.com on 15 Jan 2009 at 7:32