SSilence / php-imap-client

a easy solution for simple IMAP email access in php
MIT License
269 stars 136 forks source link

Bad encoding in body and subject message #133

Closed TrueShutDown closed 7 years ago

TrueShutDown commented 7 years ago

Bad encoding in body and subject message.

Subject: [subject] => B造skawiczna metoda odchudzaj帷a. Mo瞠sz schudn望 nawet 15 kg.

Correct: [subject] => Błyskawiczna metoda odchudzająca. Możesz schudnąć nawet 15 kg.

Body: (...)<title>P豉ski brzuch, smuk豉 talia - wystacz?2 tygodnie!</title>(...)

Correct: (...)<title>Płaski brzuch, smukła talia - wystaczą 2 tygodnie!</title>(...)

Var dump:

SSilence\ImapClient\IncomingMessage Object
(
    [header] => stdClass Object
        (
            [subject] => B造skawiczna metoda odchudzaj帷a. Mo瞠sz schudn望 nawet 15 kg.
            [from] => =?utf-8?b?RHIgQmVhdGEgR2FyZGFzLiBJbnRyZWRvLiAvV1A=?= <wp@wp.pl>
            [to] => shutjunk@wp.pl
            [date] => Wed, 13 Jul 2016 02:24:03 +0200
            [message_id] => <935796927.449099021468369443508.JavaMail.root@c42b46de4dda>
            [size] => 70181
            [uid] => 4
            [msgno] => 4
            [recent] => 0
            [flagged] => 0
            [answered] => 0
            [deleted] => 0
            [seen] => 1
            [draft] => 0
            [udate] => 1468369443
            [details] => stdClass Object
                (
                    [date] => Wed, 13 Jul 2016 02:24:03 +0200
                    [Date] => Wed, 13 Jul 2016 02:24:03 +0200
                    [subject] => B造skawiczna metoda odchudzaj帷a. Mo瞠sz schudn望 nawet 15 kg.
                    [Subject] => B造skawiczna metoda odchudzaj帷a. Mo瞠sz schudn望 nawet 15 kg.
                    [message_id] => <935796927.449099021468369443508.JavaMail.root@c42b46de4dda>
                    [toaddress] => shutjunk@wp.pl
                    [to] => Array
                        (
                            [0] => stdClass Object
                                (
                                    [mailbox] => shutjunk
                                    [host] => wp.pl
                                )

                        )

                    [fromaddress] => =?utf-8?b?RHIgQmVhdGEgR2FyZGFzLiBJbnRyZWRvLiAvV1A=?= <wp@wp.pl>
                    [from] => Array
                        (
                            [0] => stdClass Object
                                (
                                    [personal] => =?utf-8?b?RHIgQmVhdGEgR2FyZGFzLiBJbnRyZWRvLiAvV1A=?=
                                    [mailbox] => wp
                                    [host] => wp.pl
                                )

                        )

                    [reply_toaddress] => mail@intredo.com
                    [reply_to] => Array
                        (
                            [0] => stdClass Object
                                (
                                    [mailbox] => mail
                                    [host] => intredo.com
                                )

                        )

                    [Recent] =>  
                    [Unseen] =>  
                    [Flagged] =>  
                    [Answered] =>  
                    [Deleted] =>  
                    [Draft] =>  
                    [Msgno] =>    4
                    [MailDate] => 13-Jul-2016 02:24:03 +0200
                    [Size] => 70181
                    [udate] => 1468369443
                )

        )

    [message] => stdClass Object
        (
            [html] => <html>
    <head>
        <meta http-equiv="content-type" content="text/html; charset=iso-8859-2" />
        <meta name="viewport" content="width=device-width, initial-scale=1.0">
        <title>P豉ski brzuch, smuk豉 talia - wystacz?2 tygodnie!</title>
    </head>
    <body>
        <a target="_blank" title="P豉ski brzuch, smuk豉 talia - wystacz?2 tygodnie!" href="http://dot.wp.pl/redir?SN=mailingwp&url=http%3A%2F%2Fwww.adxtro.com%2Fr%2Fwro2SBn9Wd-GgxOtSI5hM%2F%3Futm_creation%3Dmailing223&par=mailingid%3Dc02cf39bca82cb6f9de8dffb1a9efb6%26creationid%3D47094%26elementid%3D1%26uuid%3Dd96900d1-cc3a-4f3b-9c4e-cccdc5c67341%26scoremode%3D0%26systemid%3D1"><img src="cid:mailing223.jpg" alt="P豉ski brzuch, smuk豉 talia - wystacz?2 tygodnie!"/></a>

<br /><img src="https://dot.wp.pl/dot.gif?SN=mailingwp&Pos=mailing&pg=http%3A%2F%2Fmailing%2F%3Fmailingid%3Dc02cf39bca82cb6f9de8dffb1a9efb6%26creationid%3D47094%26uuid%3Dd96900d1-cc3a-4f3b-9c4e-cccdc5c67341%26scoremode%3D0%26systemid%3D1" width="1" height="1" border="0"  style="display:none"/>
<br /><div style="border: 0; color: #222; font-family: 'Helvetica Neue', Helvetica, Roboto, Arial, sans-serif; font-size: 12px; line-height: 16px; margin-top: 12px; margin-bottom: 12px; padding: 0; text-align: left;">
<div style="background: #dddddd; display: block; font-size: 1px; height: 1px; line-height: 1px; margin-bottom: 12px; overflow: hidden; width: 50%;"></div>
List przekazany przez Grup?Wirtualna Polska S.A. na zlecenie firmy:
<br />
<br />Intredo Ltd
<br />7A Sir Luigi Camilleri Street,
<br />Sliema SLM 1843, Malta 
<br />
<br />
<br />List zosta?wys豉ny zgodnie z regulaminami kont pocztowych Grupy Wirtualna Polska S.A.
<div style="background: #dddddd; display: block; font-size: 1px; height: 1px; line-height: 1px; margin-top: 12px; overflow: hidden; width: 50%;"></div>
</div></body>
</html>

            [info] => Array
                (
                    [0] => SSilence\ImapClient\Section Object
                        (
                            [_structure:SSilence\ImapClient\Section:private] => stdClass Object
                                (
                                    [type] => 0
                                    [encoding] => 3
                                    [ifsubtype] => 1
                                    [subtype] => HTML
                                    [ifdescription] => 0
                                    [ifid] => 0
                                    [lines] => 33
                                    [bytes] => 2538
                                    [ifdisposition] => 0
                                    [ifdparameters] => 0
                                    [ifparameters] => 1
                                    [parameters] => Array
                                        (
                                            [0] => stdClass Object
                                                (
                                                    [attribute] => charset
                                                    [value] => ISO-8859-2
                                                )

                                        )

                                )

                            [_body:SSilence\ImapClient\Section:private] => <html>
    <head>
        <meta http-equiv="content-type" content="text/html; charset=iso-8859-2" />
        <meta name="viewport" content="width=device-width, initial-scale=1.0">
        <title>P豉ski brzuch, smuk豉 talia - wystacz?2 tygodnie!</title>
    </head>
    <body>
        <a target="_blank" title="P豉ski brzuch, smuk豉 talia - wystacz?2 tygodnie!" href="http://dot.wp.pl/redir?SN=mailingwp&url=http%3A%2F%2Fwww.adxtro.com%2Fr%2Fwro2SBn9Wd-GgxOtSI5hM%2F%3Futm_creation%3Dmailing223&par=mailingid%3Dc02cf39bca82cb6f9de8dffb1a9efb6%26creationid%3D47094%26elementid%3D1%26uuid%3Dd96900d1-cc3a-4f3b-9c4e-cccdc5c67341%26scoremode%3D0%26systemid%3D1"><img src="cid:mailing223.jpg" alt="P豉ski brzuch, smuk豉 talia - wystacz?2 tygodnie!"/></a>

<br /><img src="https://dot.wp.pl/dot.gif?SN=mailingwp&Pos=mailing&pg=http%3A%2F%2Fmailing%2F%3Fmailingid%3Dc02cf39bca82cb6f9de8dffb1a9efb6%26creationid%3D47094%26uuid%3Dd96900d1-cc3a-4f3b-9c4e-cccdc5c67341%26scoremode%3D0%26systemid%3D1" width="1" height="1" border="0"  style="display:none"/>
<br /><div style="border: 0; color: #222; font-family: 'Helvetica Neue', Helvetica, Roboto, Arial, sans-serif; font-size: 12px; line-height: 16px; margin-top: 12px; margin-bottom: 12px; padding: 0; text-align: left;">
<div style="background: #dddddd; display: block; font-size: 1px; height: 1px; line-height: 1px; margin-bottom: 12px; overflow: hidden; width: 50%;"></div>
List przekazany przez Grup?Wirtualna Polska S.A. na zlecenie firmy:
<br />
<br />Intredo Ltd
<br />7A Sir Luigi Camilleri Street,
<br />Sliema SLM 1843, Malta 
<br />
<br />
<br />List zosta?wys豉ny zgodnie z regulaminami kont pocztowych Grupy Wirtualna Polska S.A.
<div style="background: #dddddd; display: block; font-size: 1px; height: 1px; line-height: 1px; margin-top: 12px; overflow: hidden; width: 50%;"></div>
</div></body>
</html>

                        )

                )

            [text] => 
        )

    [attachments] => Array
        (
                (...)
        )

    [section] => Array
        (
            [0] => 1
            [1] => 2
        )

    [structure] => stdClass Object
        (
            [type] => 1
            [encoding] => 0
            [ifsubtype] => 1
            [subtype] => RELATED
            [ifdescription] => 0
            [ifid] => 0
            [ifdisposition] => 0
            [ifdparameters] => 0
            [ifparameters] => 1
            [parameters] => Array
                (
                    [0] => stdClass Object
                        (
                            [attribute] => boundary
                            [value] => ----=_Part_45030822_955857257.1468369443508
                        )

                )

            [parts] => Array
                (
                    [0] => stdClass Object
                        (
                            [type] => 0
                            [encoding] => 3
                            [ifsubtype] => 1
                            [subtype] => HTML
                            [ifdescription] => 0
                            [ifid] => 0
                            [lines] => 33
                            [bytes] => 2538
                            [ifdisposition] => 0
                            [ifdparameters] => 0
                            [ifparameters] => 1
                            [parameters] => Array
                                (
                                    [0] => stdClass Object
                                        (
                                            [attribute] => charset
                                            [value] => ISO-8859-2
                                        )

                                )

                        )

                    [1] => stdClass Object
                        (
                            [type] => 5
                            [encoding] => 3
                            [ifsubtype] => 1
                            [subtype] => JPEG
                            [ifdescription] => 0
                            [ifid] => 1
                            [id] => <mailing223.jpg>
                            [bytes] => 66764
                            [ifdisposition] => 1
                            [disposition] => inline
                            [ifdparameters] => 1
                            [dparameters] => Array
                                (
                                    [0] => stdClass Object
                                        (
                                            [attribute] => filename
                                            [value] => mailing223.jpg
                                        )

                                )

                            [ifparameters] => 1
                            [parameters] => Array
                                (
                                    [0] => stdClass Object
                                        (
                                            [attribute] => name
                                            [value] => mailing223.jpg
                                        )

                                )

                        )

                )

        )

    [debug] => 
    [imapStream:SSilence\ImapClient\IncomingMessage:private] => Resource id #5
    [id:SSilence\ImapClient\IncomingMessage:private] => 4
    [uid:SSilence\ImapClient\IncomingMessage:private] => 
    [countAttachment:SSilence\ImapClient\IncomingMessage:private] => 1
)
sergey144010 commented 7 years ago

Can you send this email to me? sergey144010 gmail.com

TrueShutDown commented 7 years ago

I send e-mail to you from shutjunk (at) wp.pl. Check SPAM folder, because this message is ad.

sergey144010 commented 7 years ago

I will sort this out.

MrSinor commented 7 years ago

Also this problem is in getBriefInfoMessages();

sergey144010 commented 7 years ago

Try this and show me

    $imap->getMessage(1);
    $str = $imap->incomingMessage->header->subject;
    $str = iconv('utf-8', 'iso-8859-2', $str);
    var_dump($str);
sergey144010 commented 7 years ago

Where do you put var_dump() ? You have Notepad++ program ? If you write in the script like this:

    $imap->getMessage(1);
    echo $imap->incomingMessage->header->subject;

And write the output to a file, like this:

    php test.php > test.txt

And then open it in Notepad++. You will see the correct string. Same thing with this:

    $imap->getMessage(1);
    echo $imap->incomingMessage->header->body;

And choose the ISO 8859-2 encoding.

If you display var_dump() or echo() in a browser or console, then they need to specify iso-8859-2 encoding.

Check it out and write here.

TrueShutDown commented 7 years ago

Ok, I write in script iconv('iso-8859-2', 'utf-8', $email->header->subject) and this working.

maybe good idea is add option who allow recognize and auto convert body and subject to utf-8?

Thanks

sergey144010 commented 7 years ago

The updated version of the code contains the encoding field here $imap->incomingMessage->message->html->charset. It's all for now. Then we'll see.

sergey144010 commented 7 years ago

You can also completely disable decoding and see how the original string looks. getMessage($id, $decode = IncomingMessage::NOT_DECODE)

TrueShutDown commented 7 years ago

Ok, Thank you for support :)