MetricsGrimoire / CVSAnalY

The CVSAnalY tool extracts information out of source code repository logs and stores it into a database.
http://metricsgrimoire.github.com/CVSAnalY/
GNU General Public License v2.0
48 stars 31 forks source link

Invalid UTF8 character #99

Open canasdiaz opened 8 years ago

canasdiaz commented 8 years ago
Parsing log for /home/joomla/dashboards/joomla/scm/utilities (git)
/usr/local/lib/python2.7/dist-packages/cvsanaly2-2.1.0-py2.7.egg/pycvsanaly2/DBTempLog.py:133: Warning: Invalid utf8 character string: '800228'
  commits)
/usr/local/lib/python2.7/dist-packages/cvsanaly2-2.1.0-py2.7.egg/pycvsanaly2/DBTempLog.py:143: Warning: Invalid utf8 character string: '800228'
  statement("INSERT into _temp_log (rev, date, object) values (?, ?, ?)", self.db.place_holder), commits)
Executing extensions
Executing extension FileTypes
Executing extension CommitsLOC
[2015-11-27 04:25:57] /usr/local/bin/cvsanaly2 -u root -p '' -d jalonso_cvsanaly_joomla_6925 --extensions=CommitsLOC,FileTypes >> /home/joomla/dashboards/joomla//log/launch_cvsanaly.log 2>&1
sduenas commented 8 years ago

In order to reproduce the error, I need to know which is the repository that you are analyzing.

bvasiles commented 7 years ago

The Invalid utf8 character string: '800228' warning happens when analyzing CVSAnalY's repository.

jorgechollet commented 6 years ago

In DBTempLog.py, both INSERT statements on table _temp_log may need the keyword "_binary" before the third placeholder inside "values".

For example, current statement: "INSERT into _temp_log (rev, date, object) values (?, ?, ?)"

May need to be changed to: "INSERT into _temp_log (rev, date, object) values (?, ?, _binary ?)"

I made the change in both INSERT statements and I don't see the error message anymore. Not sure if this is the best way to solve this.

I got this from here: https://bugs.mysql.com/bug.php?id=79317

Cheers.