codex-team / editor.js

A block-style editor with clean JSON output
https://editorjs.io
Apache License 2.0
28.44k stars 2.07k forks source link

Pasting from MS Word places random new line characters in the paragraph text #2350

Open victor-david opened 1 year ago

victor-david commented 1 year ago

Describe a bug.

When pasting from MS Word into the editor, random new line characters (\n) get placed into the resultant JSON [within the paragraph text itself].

Steps to reproduce:

  1. Open the editor
  2. Copy from Word and paste.
  3. Save
    {
    "id": "1LRfwKqMBi",
    "type": "paragraph",
    "data": {
      "text": "For three days it rained and everything he carried except\nfor..."
    }
    },

    These line breaks are not in the Word doc. I looked at the xml inside the .docx. Also, if you type everything into the editor, it doesn't put random line breaks.

Expected behavior:

No extra line breaks.

Why does this matter?

My content management system has an option to strip all line breaks before delivering the html to the browser to cut down on final size.

<html><head>...</head><body>...</body></html>

I can't activate that option because then the paragraphs with random line breaks have words mashed together.

Editor.js version:

v2.26.5

Plugins you use with their versions:

No plugins

ssingh3856 commented 1 year ago

is this issue still open?

bhaveshvasnani commented 1 year ago

@neSpecc If this issue is still open I would love to contribute. I'm ramping up on this project and would really appreciate if you could give me some pointers to start. I would continue to look on my side as well and discuss the fix or raise a PR.

My hunch is that I might need to identify the discrepancy here in handlePasteEvent. Is that in the right direction?

kcherenkov commented 6 months ago

Hi there, the issue still exists in v2.29.1. It breaks text pasted from Word by adding random new line characters. Could someone please look into this?