ietf-tools / datatracker

The day-to-day front-end to the IETF database for people who work on IETF standards.
https://datatracker.ietf.org
BSD 3-Clause "New" or "Revised" License
583 stars 349 forks source link

Problems with draft submission email parsing #1835

Closed ietf-svn-bot closed 1 year ago

ietf-svn-bot commented 8 years ago

type_defect | by rcross@amsl.com


A comma in the email address string causes parsing problems:

In 6d4a31ba35b707e1679740f4da71dd1cec12e4f9: Submission.objects.get(name='draft-birrane-dtn-sbsp',rev='01').authors Out6d4a31ba35b707e1679740f4da71dd1cec12e4f9: u'Edward J. Birrane, III Edward.Birrane@jhuapl.edu\nJeremy Pierce-Mayer jeremy.mayer@insyen.com\nDennis C. Iannicca dennis.c.iannicca@nasa.gov'

Edward: (550, '5.1.1 : Recipient address rejected: User unknown in local recipient table')

The original message follows:

Subject: New Version Notification for draft-birrane-dtn-sbsp-01.txt Date: Fri, 16 Oct 2015 14:11:01 -0700 To: , "III" Edward.Birrane@jhuapl.edu, "Dennis C. Iannicca" dennis.c.iannicca@nasa.gov, "Edward J. Birrane" edward.birrane@jhuapl.edu, "Dennis C. Iannicca" dennis.c.iannicca@nasa.gov, "Jeremy Pierce-Mayer" jeremy.mayer@insyen.com, "Jeremy Pierce-Mayer" jeremy.mayer@insyen.com


Here is an example of a submission which results in a address

In dc22fb842160a0c0387457a61838e807e01c8c4a: Submission.objects.get(name='draft-sreekantiah-idr-segment-routing-te').authors Outdc22fb842160a0c0387457a61838e807e01c8c4a: u'Arjun Sreekantiah asreekan@cisco.com\nClarence FilsFils cfilsfil@cisco.com\nStefano Previdi sprevidi@cisco.com\nSiva Sivabalan msiva@cisco.com\nPaul Mattes pamattes@microsoft.com\nSteven Lin '


Issue migrated from trac:1835 at 2022-03-04 04:48:28 +0000

ietf-svn-bot commented 8 years ago

@rjsparks@nostrum.com commented


This probably involves work on the parser, and on the thing that creates the author value to put display names in quotes if necessary. (I see we have a similar problem with Email objects' formatted address.)

There's also some data cleanup to be done. Here are strings from Submission.author that have commas:

{u'Bhumip Khasnabish <vumip1@gmail.com, bhumip.khasnabish@ztetx.com>',
 u'Brasher, D L., "DIASER manual", July 2010, http://',
 u'Caitlin Bestler <caitlin.bestler@nexenta.com,cait@asomi.com>',
 u'Cryptonector, LLC <nico@cryptonector.com>',
 u'Deng, X., Zhou, C., Boucadair, M., Bajko, G.,',
 u'Dimitri Papadimitriou <dimitri.papadimitriou@alcatel-lucent,be>',
 u'Edward J. Birrane, III <Edward.Birrane@jhuapl.edu>',
 u'Google, Inc <mkwst@google.com>',
 u'Grayson, Frank Brockners, Woj Dec, Gaetan Feige',
 u'Hitachi, Ltd., Yokohama Research Laboratory <kazuya.monden.vw@hitachi.com>',
 u'Huawei Technologies, <linda.dunbar@huawei.com>',
 u'Lodderstedt, T.',
 u'Philippe Niger <philippe,niger@orange-ftgroup.com>',
 u'Poehls, Henrich <hp@sec.uni-passau.de>',
 u'Ramalho, L. Netsch, Y. Stachurski, Miao Lei, H. Taddei,',
 u'Ramalho, L. Netsch, Y. Stachurski, Miao Lei, H. Taddei, Q.',
 u'Raymond Key <raymond.key@team.telstra,com>',
 u'Room 225, Main Building, Tsinghua University <congxiao@cernet.edu.cn>',
 u'Samsung Telecommunications America, <narendrasingh.bisht@gmail.com>',
 u'The RSOC will operate under the authority of the IAB, with the IAB <serve@the>',
 u'Tom Kristensen <tomkrist@cisco.com, tomkri@ifi.uio.no>',
 u'Tsou, T., Li, W.,',
 u'Tsou, T., Zhou, C., Sun, Q., Boucadair, M.,',
 u'Young Lee <ylee@huawei,com>',
 u'Zhen Cao <zhen.cao@gmail.com, caozhen@chinamobile.com>'}
ietf-svn-bot commented 5 years ago

@rjsparks@nostrum.com changed priority from major to medium

ietf-svn-bot commented 4 years ago

@rjsparks@nostrum.com changed status from new to accepted

rjsparks commented 1 year ago

closing as stale.