Closed Nirus2000 closed 3 months ago
I am skeptical that this change will help us in the long run.
What problem exactly is this pull request trying to address?
My hunch is:
Where am I wrong?
Where I see the point is to move the "Mrz" and "Mär" handling to the ExtractorUtil.asDate
method. Although I understand at the moment we know only that the Sutor bank that is abbreviating "März" as "Mrz". This change I will cherry-pick regardless of the other discussion.
And: please, please, before make the change that takes a lot of work (I know looking up all month name in all languages takes a long of work and diligence), we can also make a draft change and then discuss :-)
Hello @buchen I understand... the first goal has already been successful. The issue with the JDK 7 to 8... okay, your variant is smarter ;-)
The problem is that with this pattern, we get a fail faster than if we work .*
or [\w]{3,4}
or [\wä]{3,4}
and then java checks if this is a date. I therefore do not believe that this is slower.
We pattern first and then we check if it's a date, right?
What is the best pattern for the month.. an universal pattern. 👍🏻
This only applies to the names of the months, there are no other date differences in my opinion. Like... one or two digit numbers... the month name ...
Alex 🔢
Improvement of the regular expression in the date by replacing the regex pattern in the format MMMM (LLLL) and MMM (LLL). This is defined in the ExtractorUtils as a static variable.
Remove .replace("Mar", "Mar") (JDK7 vs. JDK8)
https://github.com/portfolio-performance/portfolio/pull/4029#discussion_r1612042728 and ff.
@buchen -> https://github.com/portfolio-performance/portfolio/issues/2683