maxharlow / csvmatch

🔎 Finds fuzzy matches between CSV files
Other
174 stars 22 forks source link

CSVmatch great but install problem in Ubuntu Linux #16

Closed sfd99 closed 6 years ago

sfd99 commented 6 years ago

Hi Max, I think that csvmatch is clever and great!

But when trying to install with: pip install csvmatch
in my PC with Ubuntu Linux 14.04 LTS 32-bit, (it includes:. Python 2.7.6 [GCC 4.8.4] on linux2) I get the errors below.

I'm not familiar w/Python - I'm an R person... and don't understand what these error messages mean.

What can I do to complete the csvmatch install on my PC?.

Love to try it out,,,help, Max! SFd99 San Francisco INSTALL MESSAGES:

~$ pip install csvmatch
Downloading/unpacking csvmatch
  Downloading csvmatch-1.13-py2.py3-none-any.whl
Downloading/unpacking doublemetaphone==0.1 (from csvmatch)
  Downloading DoubleMetaphone-0.1.tar.gz
  Running setup.py (path:/tmp/pip_build_ray/doublemetaphone/setup.py) egg_info for package doublemetaphone

Downloading/unpacking colorama==0.3.9 (from csvmatch)
  Downloading colorama-0.3.9-py2.py3-none-any.whl
Downloading/unpacking chardet==3.0.4 (from csvmatch)
  Downloading chardet-3.0.4-py2.py3-none-any.whl (133kB): 133kB downloaded
Downloading/unpacking jellyfish==0.5.6 (from csvmatch)
  Downloading jellyfish-0.5.6.tar.gz (132kB): 132kB downloaded
  Running setup.py (path:/tmp/pip_build_ray/jellyfish/setup.py) egg_info for package jellyfish

    warning: no previously-included files matching '.git' found anywhere in distribution
Requirement already satisfied (use --upgrade to upgrade): unidecode==0.4.21 in /usr/local/lib/python2.7/dist-packages (from csvmatch)
Downloading/unpacking dedupe==1.8.1 (from csvmatch)
  Downloading dedupe-1.8.1.tar.gz (54kB): 54kB downloaded
  Running setup.py (path:/tmp/pip_build_ray/dedupe/setup.py) egg_info for package dedupe

Downloading/unpacking fastcluster (from dedupe==1.8.1->csvmatch)
  Downloading fastcluster-1.1.24.tar.gz (166kB): 166kB downloaded
  Running setup.py (path:/tmp/pip_build_ray/fastcluster/setup.py) egg_info for package fastcluster
    Version: 1.1.24

Downloading/unpacking dedupe-hcluster (from dedupe==1.8.1->csvmatch)
  Downloading dedupe-hcluster-0.3.2.tar.gz (166kB): 166kB downloaded
  Running setup.py (path:/tmp/pip_build_ray/dedupe-hcluster/setup.py) egg_info for package dedupe-hcluster

Downloading/unpacking affinegap>=1.3 (from dedupe==1.8.1->csvmatch)
  Downloading affinegap-1.10.tar.gz
  Running setup.py (path:/tmp/pip_build_ray/affinegap/setup.py) egg_info for package affinegap

Downloading/unpacking categorical-distance>=1.9 (from dedupe==1.8.1->csvmatch)
  Downloading categorical_distance-1.9-py2-none-any.whl
Downloading/unpacking dedupe-variable-datetime (from dedupe==1.8.1->csvmatch)
  Downloading dedupe_variable_datetime-0.1.5-py2-none-any.whl
Requirement already satisfied (use --upgrade to upgrade): future>=0.14 in /usr/local/lib/python2.7/dist-packages (from dedupe==1.8.1->csvmatch)
Downloading/unpacking rlr>=2.4.3 (from dedupe==1.8.1->csvmatch)
  Downloading rlr-2.4.3-py2.py3-none-any.whl
Downloading/unpacking numpy>=1.12 (from dedupe==1.8.1->csvmatch)
  Downloading numpy-1.14.1.zip (4.9MB): 4.9MB downloaded
  Running setup.py (path:/tmp/pip_build_ray/numpy/setup.py) egg_info for package numpy
    Running from numpy source directory.
    /usr/lib/python2.7/distutils/dist.py:267: UserWarning: Unknown distribution option: 'python_requires'
      warnings.warn(msg)

    warning: no previously-included files matching '*.pyc' found anywhere in distribution
    warning: no previously-included files matching '*.pyo' found anywhere in distribution
    warning: no previously-included files matching '*.pyd' found anywhere in distribution
    warning: no previously-included files matching '*.swp' found anywhere in distribution
    warning: no previously-included files matching '*.bak' found anywhere in distribution
    warning: no previously-included files matching '*~' found anywhere in distribution
Downloading/unpacking highered>=0.2.0 (from dedupe==1.8.1->csvmatch)
  Downloading highered-0.2.1-py2.py3-none-any.whl
Downloading/unpacking simplecosine>=1.2 (from dedupe==1.8.1->csvmatch)
  Downloading simplecosine-1.2-py2.py3-none-any.whl
Downloading/unpacking haversine>=0.4.1 (from dedupe==1.8.1->csvmatch)
  Downloading haversine-0.4.5.tar.gz
  Running setup.py (path:/tmp/pip_build_ray/haversine/setup.py) egg_info for package haversine

Downloading/unpacking BTrees>=4.1.4 (from dedupe==1.8.1->csvmatch)
  Downloading BTrees-4.4.1.tar.gz (166kB): 166kB downloaded
  Running setup.py (path:/tmp/pip_build_ray/BTrees/setup.py) egg_info for package BTrees
    warning: no previously-included files matching '*.dll' found anywhere in distribution
    warning: no previously-included files matching '*.pyc' found anywhere in distribution
    warning: no previously-included files matching '*.pyo' found anywhere in distribution
    warning: no previously-included files matching '*.so' found anywhere in distribution
    warning: no previously-included files matching 'coverage.xml' found anywhere in distribution
    no previously-included directories found matching 'docs/_build'
    no previously-included directories found matching 'persistent/__pycache__'
    In file included from persistent/cPersistence.h:18:0,
                     from persistent/cPersistence.c:19:
    persistent/_compat.h:18:20: fatal error: Python.h: No such file or directory
     #include "Python.h"
                        ^
    compilation terminated.
    Traceback (most recent call last):
      File "<string>", line 17, in <module>
      File "/tmp/pip_build_ray/BTrees/setup.py", line 158, in <module>
        """
      File "/usr/lib/python2.7/distutils/core.py", line 111, in setup
        _setup_distribution = dist = klass(attrs)
      File "/usr/lib/python2.7/dist-packages/setuptools/dist.py", line 239, in __init__
        self.fetch_build_eggs(attrs.pop('setup_requires'))
      File "/usr/lib/python2.7/dist-packages/setuptools/dist.py", line 264, in fetch_build_eggs
        replace_conflicting=True
      File "/usr/lib/python2.7/dist-packages/pkg_resources.py", line 620, in resolve
        dist = best[req.key] = env.best_match(req, ws, installer)
      File "/usr/lib/python2.7/dist-packages/pkg_resources.py", line 858, in best_match
        return self.obtain(req, installer) # try and download/install
      File "/usr/lib/python2.7/dist-packages/pkg_resources.py", line 870, in obtain
        return installer(requirement)
      File "/usr/lib/python2.7/dist-packages/setuptools/dist.py", line 314, in fetch_build_egg
        return cmd.easy_install(req)
      File "/usr/lib/python2.7/dist-packages/setuptools/command/easy_install.py", line 616, in easy_install
        return self.install_item(spec, dist.location, tmpdir, deps)
      File "/usr/lib/python2.7/dist-packages/setuptools/command/easy_install.py", line 646, in install_item
        dists = self.install_eggs(spec, download, tmpdir)
      File "/usr/lib/python2.7/dist-packages/setuptools/command/easy_install.py", line 834, in install_eggs
        return self.build_and_install(setup_script, setup_base)
      File "/usr/lib/python2.7/dist-packages/setuptools/command/easy_install.py", line 1040, in build_and_install
        self.run_setup(setup_script, setup_base, args)
      File "/usr/lib/python2.7/dist-packages/setuptools/command/easy_install.py", line 1028, in run_setup
        raise DistutilsError("Setup script exited with %s" % (v.args[0],))
    distutils.errors.DistutilsError: Setup script exited with error: command 'i686-linux-gnu-gcc' failed with exit status 1
    Complete output from command python setup.py egg_info:
    warning: no previously-included files matching '*.dll' found anywhere in distribution

warning: no previously-included files matching '*.pyc' found anywhere in distribution

warning: no previously-included files matching '*.pyo' found anywhere in distribution

warning: no previously-included files matching '*.so' found anywhere in distribution

warning: no previously-included files matching 'coverage.xml' found anywhere in distribution

no previously-included directories found matching 'docs/_build'

no previously-included directories found matching 'persistent/__pycache__'

In file included from persistent/cPersistence.h:18:0,

                 from persistent/cPersistence.c:19:

persistent/_compat.h:18:20: fatal error: Python.h: No such file or directory

 #include "Python.h"

                    ^

compilation terminated.

Traceback (most recent call last):

  File "<string>", line 17, in <module>

  File "/tmp/pip_build_ray/BTrees/setup.py", line 158, in <module>

    """

  File "/usr/lib/python2.7/distutils/core.py", line 111, in setup

    _setup_distribution = dist = klass(attrs)

  File "/usr/lib/python2.7/dist-packages/setuptools/dist.py", line 239, in __init__

    self.fetch_build_eggs(attrs.pop('setup_requires'))

  File "/usr/lib/python2.7/dist-packages/setuptools/dist.py", line 264, in fetch_build_eggs

    replace_conflicting=True

  File "/usr/lib/python2.7/dist-packages/pkg_resources.py", line 620, in resolve

    dist = best[req.key] = env.best_match(req, ws, installer)

  File "/usr/lib/python2.7/dist-packages/pkg_resources.py", line 858, in best_match

    return self.obtain(req, installer) # try and download/install

  File "/usr/lib/python2.7/dist-packages/pkg_resources.py", line 870, in obtain

    return installer(requirement)

  File "/usr/lib/python2.7/dist-packages/setuptools/dist.py", line 314, in fetch_build_egg

    return cmd.easy_install(req)

  File "/usr/lib/python2.7/dist-packages/setuptools/command/easy_install.py", line 616, in easy_install

    return self.install_item(spec, dist.location, tmpdir, deps)

  File "/usr/lib/python2.7/dist-packages/setuptools/command/easy_install.py", line 646, in install_item

    dists = self.install_eggs(spec, download, tmpdir)

  File "/usr/lib/python2.7/dist-packages/setuptools/command/easy_install.py", line 834, in install_eggs

    return self.build_and_install(setup_script, setup_base)

  File "/usr/lib/python2.7/dist-packages/setuptools/command/easy_install.py", line 1040, in build_and_install

    self.run_setup(setup_script, setup_base, args)

  File "/usr/lib/python2.7/dist-packages/setuptools/command/easy_install.py", line 1028, in run_setup

    raise DistutilsError("Setup script exited with %s" % (v.args[0],))

distutils.errors.DistutilsError: Setup script exited with error: command 'i686-linux-gnu-gcc' failed with exit status 1

----------------------------------------
Cleaning up...
Command python setup.py egg_info failed with error code 1 in /tmp/pip_build_ray/BTrees
Storing debug log for failure in /home/ray/.pip/pip.log
maxharlow commented 6 years ago

I think this might be because Python needs some extra stuff installed to compile the libraries being used by CSV Match. Could you try running:

sudo apt-get install python-dev

Then try installing csvmatch again. Let me know if it works.

sfd99 commented 6 years ago

Hi Max! Thanks for replying. PC with Ubuntu Linux 14.04 LTS 32-bit, (it includes: Python 2.7.6 [GCC 4.8.4]).

As suggested, I did: sudo apt-get install python-dev That seemed to work fine.

Then, repeated the pip install step: pip install csvmatch

Got this response: $ sudo apt-get install python-dev

[sudo] password: 
Reading package lists... Done
Building dependency tree       
Reading state information... Done
The following extra packages will be installed:
  libpython-dev libpython2.7-dev python2.7-dev
The following NEW packages will be installed:
  libpython-dev libpython2.7-dev python-dev python2.7-dev
0 upgraded, 4 newly installed, 0 to remove and 0 not upgraded.
Need to get 22.1 MB of archives.
After this operation, 31.1 MB of additional disk space will be used.
Do you want to continue? [Y/n] y
Get:1 http://us.archive.ubuntu.com/ubuntu/ trusty-updates/main libpython2.7-dev i386 2.7.6-8ubuntu0.4 [21.8 MB]
13% [1 libpython2.7-dev 2,774 kB/21.8 MB 13%]                      428 kB/s14% [1 libpython2.7-dev 3,018 kB/21.8 MB 14%]                      428 kB/s15% [1 libpython2.7-dev 3,250 kB/21.8 MB 15%]                      428 kB/s16% [1 libpython2.7-dev 3,489 kB/21.8 MB 16%]                      428 kB/s17% [1 libpython2.7-dev 3,731 kB/21.8 MB 17%]                      428 kB/s18% [1 libpython2.7-dev 3,965 kB/21.8 MB 18%]                      428 kB/s19% [1 libpython2.7-dev 4,199 kB/21.8 MB 19%]                      428 kB/s20% [1 libpython2.7-dev 4,450 kB/21.8 MB 20%]                      428 kB/s21% [1 libpython2.7-dev 4,683 kB/21.8 MB 21%]                      428 kB/s22% [1 libpython2.7-dev 4,917 kB/21.8 MB 23%]                      428 kB/s23% [1 libpython2.7-dev 5,159 kB/21.8 MB 24%]                      428 kB/s24% [1 libpython2.7-dev 5,396 kB/21.8 MB 25%]                      428 kB/s26% [1 libpython2.7-dev 5,632 kB/21.8 MB 26%]                      476 kB/s27% [1 libpython2.7-dev 5,872 kB/21.8 MB 27%]                      476 kB/s28% [1 libpython2.7-dev 6,114 kB/21.8 MB 28%]                      476 kB/s29% [1 libpython2.7-dev 6,351 kB/21.8 MB 29%]                      476 kB/s30% [1 libpython2.7-dev 6,587 kB/21.8 MB 30%]                      476 kB/s31% [1 libpython2.7-dev 6,821 kB/21.8 MB 31%]                      476 kB/s32% [1 libpython2.7-dev 7,057 kB/21.8 MB 32%]                      476 kB/s33% [1 libpython2.7-dev 7,303 kB/21.8 MB 34%]                      476 kB/s34% [1 libpython2.7-dev 7,536 kB/21.8 MB 35%]                      476 kB/s35% [1 libpython2.7-dev 7,776 kB/21.8 MB 36%]                      476 kB/s36% [1 libpython2.7-dev 8,018 kB/21.8 MB 37%]                      476 kB/s37% [1 libpython2.7-dev 8,255 kB/21.8 MB 38%]                      476 kB/s38% [1 libpython2.7-dev 8,494 kB/21.8 MB 39%]                      476 kB/s40% [1 libpython2.7-dev 8,733 kB/21.8 MB 40%]                      476 kB/s41% [1 libpython2.7-dev 8,973 kB/21.8 MB 41%]                      476 kB/s42% [1 libpython2.7-dev 9,215 kB/21.8 MB 42%]                      476 kB/s42% [1 libpython2.7-dev 9,373 kB/21.8 MB 43%]                      476 kB/s43% [1 libpython2.7-dev 9,510 kB/21.8 MB 44%]                      476 kB/s44% [1 libpython2.7-dev 9,748 kB/21.8 MB 45%]                      476 kB/s45% [1 libpython2.7-dev 9,986 kB/21.8 MB 46%]                      476 kB/s46% [1 libpython2.7-dev 10.2 MB/21.8 MB 47%]                       476 kB/s47% [1 libpython2.7-dev 10.5 MB/21.8 MB 48%]                       476 kB/s49% [1 libpython2.7-dev 10.7 MB/21.8 MB 49%]                       476 kB/s50% [1 libpython2.7-dev 10.9 MB/21.8 MB 50%]                       476 kB/s51% [1 libpython2.7-dev 11.2 MB/21.8 MB 51%]                       448 kB/s52% [1 libpython2.7-dev 11.4 MB/21.8 MB 52%]                       448 kB/s53% [1 libpython2.7-dev 11.7 MB/21.8 MB 53%]                       448 kB/s54% [1 libpython2.7-dev 11.9 MB/21.8 MB 55%]                       448 kB/s55% [1 libpython2.7-dev 12.1 MB/21.8 MB 56%]                       448 kB/s56% [1 libpython2.7-dev 12.4 MB/21.8 MB 57%]                       448 kB/s57% [1 libpython2.7-dev 12.6 MB/21.8 MB 58%]                       448 kB/s58% [1 libpython2.7-dev 12.9 MB/21.8 MB 59%]                       448 kB/s59% [1 libpython2.7-dev 13.1 MB/21.8 MB 60%]                       448 kB/s60% [1 libpython2.7-dev 13.3 MB/21.8 MB 61%]                       448 kB/s61% [1 libpython2.7-dev 13.6 MB/21.8 MB 62%]                       448 kB/s63% [1 libpython2.7-dev 13.8 MB/21.8 MB 63%]                       448 kB/s64% [1 libpython2.7-dev 14.0 MB/21.8 MB 64%]                       475 kB/s65% [1 libpython2.7-dev 14.3 MB/21.8 MB 66%]                       475 kB/s66% [1 libpython2.7-dev 14.5 MB/21.8 MB 67%]                       475 kB/s67% [1 libpython2.7-dev 14.8 MB/21.8 MB 68%]                       475 kB/s68% [1 libpython2.7-dev 15.0 MB/21.8 MB 69%]                       475 kB/s69% [1 libpython2.7-dev 15.2 MB/21.8 MB 70%]                       475 kB/s70% [1 libpython2.7-dev 15.5 MB/21.8 MB 71%]                       475 kB/s71% [1 libpython2.7-dev 15.7 MB/21.8 MB 72%]                       475 kB/s72% [1 libpython2.7-dev 15.9 MB/21.8 MB 73%]                       475 kB/s73% [1 libpython2.7-dev 16.2 MB/21.8 MB 74%]                       475 kB/s74% [1 libpython2.7-dev 16.4 MB/21.8 MB 75%]                       475 kB/s75% [1 libpython2.7-dev 16.7 MB/21.8 MB 76%]                       475 kB/s77% [1 libpython2.7-dev 16.9 MB/21.8 MB 78%]                       475 kB/s78% [1 libpython2.7-dev 17.1 MB/21.8 MB 79%]                       475 kB/s79% [1 libpython2.7-dev 17.4 MB/21.8 MB 80%]                        475 kB/80% [1 libpython2.7-dev 17.6 MB/21.8 MB 81%]                        475 kB/81% [1 libpython2.7-dev 17.9 MB/21.8 MB 82%]                        475 kB/82% [1 libpython2.7-dev 18.1 MB/21.8 MB 83%]                        475 kB/83% [1 libpython2.7-dev 18.3 MB/21.8 MB 84%]                        475 kB/84% [1 libpython2.7-dev 18.6 MB/21.8 MB 85%]                        475 kB/85% [1 libpython2.7-dev 18.8 MB/21.8 MB 86%]                        475 kB/86% [1 libpython2.7-dev 19.0 MB/21.8 MB 87%]                        475 kB/87% [1 libpython2.7-dev 19.3 MB/21.8 MB 89%]                        475 kB/88% [1 libpython2.7-dev 19.5 MB/21.8 MB 90%]                        475 kB/90% [1 libpython2.7-dev 19.8 MB/21.8 MB 91%]                        477 kB/91% [1 libpython2.7-dev 20.0 MB/21.8 MB 92%]                        477 kB/92% [1 libpython2.7-dev 20.2 MB/21.8 MB 93%]                        477 kB/93% [1 libpython2.7-dev 20.5 MB/21.8 MB 94%]                        477 kB/94% [1 libpython2.7-dev 20.7 MB/21.8 MB 95%]                        477 kB/95% [1 libpython2.7-dev 21.0 MB/21.8 MB 96%]                        477 kB/96% [1 libpython2.7-dev 21.2 MB/21.8 MB 97%]                        477 kB/97% [1 libpython2.7-dev 21.4 MB/21.8 MB 98%]                        477 kB/98% [1 libpython2.7-dev 21.7 MB/21.8 MB 99%]                        477 kB/99% [Working]                                                       477 kB/                                                                           Get:2 http://us.archive.ubuntu.com/ubuntu/ trusty/main libpython-dev i386 2.7.5-5ubuntu3 [7,090 B]
99% [2 libpython-dev 2,640 B/7,090 B 37%]                           477 kB/99% [Working]                                                       477 kB/                                                                           Get:3 http://us.archive.ubuntu.com/ubuntu/ trusty-updates/main python2.7-dev i386 2.7.6-8ubuntu0.4 [269 kB]
99% [3 python2.7-dev 2,637 B/269 kB 1%]                             477 kB/100% [3 python2.7-dev 242 kB/269 kB 90%]                            477 kB/100% [Working]                                                      477 kB/                                                                           Get:4 http://us.archive.ubuntu.com/ubuntu/ trusty/main python-dev i386 2.7.5-5ubuntu3 [1,176 B]
100% [4 python-dev 1,176 B/1,176 B 100%]                            477 kB/100% [Working]                                                      477 kB/                                                                           Fetched 22.1 MB in 47s (462 kB/s)
Selecting previously unselected package libpython2.7-dev:i386.
(Reading database ... 2223961 files and directories currently installed.)
Preparing to unpack .../libpython2.7-dev_2.7.6-8ubuntu0.4_i386.deb ...
Unpacking libpython2.7-dev:i386 (2.7.6-8ubuntu0.4) ...
Selecting previously unselected package libpython-dev:i386.
Preparing to unpack .../libpython-dev_2.7.5-5ubuntu3_i386.deb ...
Unpacking libpython-dev:i386 (2.7.5-5ubuntu3) ...
Selecting previously unselected package python2.7-dev.
Preparing to unpack .../python2.7-dev_2.7.6-8ubuntu0.4_i386.deb ...
Unpacking python2.7-dev (2.7.6-8ubuntu0.4) ...
Selecting previously unselected package python-dev.
Preparing to unpack .../python-dev_2.7.5-5ubuntu3_i386.deb ...
Unpacking python-dev (2.7.5-5ubuntu3) ...
Processing triggers for man-db (2.6.7.1-1ubuntu1) ...
Setting up libpython2.7-dev:i386 (2.7.6-8ubuntu0.4) ...
Setting up libpython-dev:i386 (2.7.5-5ubuntu3) ...
Setting up python2.7-dev (2.7.6-8ubuntu0.4) ...
Setting up python-dev (2.7.5-5ubuntu3) ...

============== Below, TRY PIP install csvmatch again =============== ~$ pip install csvmatch

Downloading/unpacking csvmatch
  Downloading csvmatch-1.13-py2.py3-none-any.whl
Downloading/unpacking doublemetaphone==0.1 (from csvmatch)
  Downloading DoubleMetaphone-0.1.tar.gz
  Running setup.py (path:/tmp/pip_build_ray/doublemetaphone/setup.py) egg_info for package doublemetaphone

Downloading/unpacking colorama==0.3.9 (from csvmatch)
  Downloading colorama-0.3.9-py2.py3-none-any.whl
Downloading/unpacking chardet==3.0.4 (from csvmatch)
  Downloading chardet-3.0.4-py2.py3-none-any.whl (133kB): 133kB downloaded
Downloading/unpacking jellyfish==0.5.6 (from csvmatch)
  Downloading jellyfish-0.5.6.tar.gz (132kB): 132kB downloaded
  Running setup.py (path:/tmp/pip_build_ray/jellyfish/setup.py) egg_info for package jellyfish

    warning: no previously-included files matching '.git' found anywhere in distribution
Requirement already satisfied (use --upgrade to upgrade): unidecode==0.4.21 in /usr/local/lib/python2.7/dist-packages (from csvmatch)
Downloading/unpacking dedupe==1.8.1 (from csvmatch)
  Downloading dedupe-1.8.1.tar.gz (54kB): 54kB downloaded
  Running setup.py (path:/tmp/pip_build_ray/dedupe/setup.py) egg_info for package dedupe

Downloading/unpacking fastcluster (from dedupe==1.8.1->csvmatch)
  Downloading fastcluster-1.1.24.tar.gz (166kB): 166kB downloaded
  Running setup.py (path:/tmp/pip_build_ray/fastcluster/setup.py) egg_info for package fastcluster
    Version: 1.1.24

Downloading/unpacking dedupe-hcluster (from dedupe==1.8.1->csvmatch)
  Downloading dedupe-hcluster-0.3.2.tar.gz (166kB): 166kB downloaded
  Running setup.py (path:/tmp/pip_build_ray/dedupe-hcluster/setup.py) egg_info for package dedupe-hcluster

Downloading/unpacking affinegap>=1.3 (from dedupe==1.8.1->csvmatch)
  Downloading affinegap-1.10.tar.gz
  Running setup.py (path:/tmp/pip_build_ray/affinegap/setup.py) egg_info for package affinegap

Downloading/unpacking categorical-distance>=1.9 (from dedupe==1.8.1->csvmatch)
  Downloading categorical_distance-1.9-py2-none-any.whl
Downloading/unpacking dedupe-variable-datetime (from dedupe==1.8.1->csvmatch)
  Downloading dedupe_variable_datetime-0.1.5-py2-none-any.whl
Requirement already satisfied (use --upgrade to upgrade): future>=0.14 in /usr/local/lib/python2.7/dist-packages (from dedupe==1.8.1->csvmatch)
Downloading/unpacking rlr>=2.4.3 (from dedupe==1.8.1->csvmatch)
  Downloading rlr-2.4.3-py2.py3-none-any.whl
Downloading/unpacking numpy>=1.12 (from dedupe==1.8.1->csvmatch)
  Downloading numpy-1.14.2.zip (4.9MB): 4.9MB downloaded
  Running setup.py (path:/tmp/pip_build_ray/numpy/setup.py) egg_info for package numpy
    Running from numpy source directory.
    /usr/lib/python2.7/distutils/dist.py:267: UserWarning: Unknown distribution option: 'python_requires'
      warnings.warn(msg)

    warning: no previously-included files matching '*.pyc' found anywhere in distribution
    warning: no previously-included files matching '*.pyo' found anywhere in distribution
    warning: no previously-included files matching '*.pyd' found anywhere in distribution
    warning: no previously-included files matching '*.swp' found anywhere in distribution
    warning: no previously-included files matching '*.bak' found anywhere in distribution
    warning: no previously-included files matching '*~' found anywhere in distribution
Downloading/unpacking highered>=0.2.0 (from dedupe==1.8.1->csvmatch)
  Downloading highered-0.2.1-py2.py3-none-any.whl
Downloading/unpacking simplecosine>=1.2 (from dedupe==1.8.1->csvmatch)
  Downloading simplecosine-1.2-py2.py3-none-any.whl
Downloading/unpacking haversine>=0.4.1 (from dedupe==1.8.1->csvmatch)
  Downloading haversine-0.4.5.tar.gz
  Running setup.py (path:/tmp/pip_build_ray/haversine/setup.py) egg_info for package haversine

Downloading/unpacking BTrees>=4.1.4 (from dedupe==1.8.1->csvmatch)
  Downloading BTrees-4.4.1.tar.gz (166kB): 166kB downloaded
  Running setup.py (path:/tmp/pip_build_ray/BTrees/setup.py) egg_info for package BTrees
    warning: no previously-included files matching '*.dll' found anywhere in distribution
    warning: no previously-included files matching '*.pyc' found anywhere in distribution
    warning: no previously-included files matching '*.pyo' found anywhere in distribution
    warning: no previously-included files matching '*.so' found anywhere in distribution
    warning: no previously-included files matching 'coverage.xml' found anywhere in distribution
    no previously-included directories found matching 'docs/_build'
    no previously-included directories found matching 'persistent/__pycache__'

    Installed /tmp/pip_build_ray/BTrees/persistent-4.2.4.2-py2.7-linux-i686.egg

    warning: no previously-included files matching '*.dll' found anywhere in distribution
    warning: no previously-included files matching '*.pyc' found anywhere in distribution
    warning: no previously-included files matching '*.pyo' found anywhere in distribution
    warning: no previously-included files matching '*.so' found anywhere in distribution
    warning: no previously-included files matching '*~' found anywhere in distribution
    warning: no previously-included files matching '*#' found anywhere in distribution
    warning: no previously-included files matching 'coverage.xml' found anywhere in distribution
    no previously-included directories found matching 'docs/_build'
Requirement already satisfied (use --upgrade to upgrade): simplejson in /usr/lib/python2.7/dist-packages (from dedupe==1.8.1->csvmatch)
Downloading/unpacking zope.index (from dedupe==1.8.1->csvmatch)
  Downloading zope.index-4.3.0.tar.gz (71kB): 71kB downloaded
  Running setup.py (path:/tmp/pip_build_ray/zope.index/setup.py) egg_info for package zope.index

    warning: no previously-included files matching '*.pyc' found anywhere in distribution
    warning: no previously-included files matching '*.so' found anywhere in distribution
    warning: no previously-included files matching '*.dll' found anywhere in distribution
Downloading/unpacking Levenshtein-search (from dedupe==1.8.1->csvmatch)
  Downloading Levenshtein_search-1.4.4.tar.gz
  Running setup.py (path:/tmp/pip_build_ray/Levenshtein-search/setup.py) egg_info for package Levenshtein-search

Downloading/unpacking datetime-distance (from dedupe-variable-datetime->dedupe==1.8.1->csvmatch)
  Downloading datetime_distance-0.1.3-py2-none-any.whl
Downloading/unpacking pylbfgs (from rlr>=2.4.3->dedupe==1.8.1->csvmatch)
  Downloading PyLBFGS-0.2.0.5.tar.gz (86kB): 86kB downloaded
  Running setup.py (path:/tmp/pip_build_ray/pylbfgs/setup.py) egg_info for package pylbfgs

Downloading/unpacking pyhacrf-datamade>=0.2.0 (from highered>=0.2.0->dedupe==1.8.1->csvmatch)
  Downloading pyhacrf-datamade-0.2.1.tar.gz (255kB): 255kB downloaded
  Running setup.py (path:/tmp/pip_build_ray/pyhacrf-datamade/setup.py) egg_info for package pyhacrf-datamade
    /usr/lib/python2.7/distutils/dist.py:267: UserWarning: Unknown distribution option: 'maintiner_email'
      warnings.warn(msg)

Downloading/unpacking persistent (from BTrees>=4.1.4->dedupe==1.8.1->csvmatch)
  Downloading persistent-4.2.4.2.tar.gz (96kB): 96kB downloaded
  Running setup.py (path:/tmp/pip_build_ray/persistent/setup.py) egg_info for package persistent

    warning: no previously-included files matching '*.dll' found anywhere in distribution
    warning: no previously-included files matching '*.pyc' found anywhere in distribution
    warning: no previously-included files matching '*.pyo' found anywhere in distribution
    warning: no previously-included files matching '*.so' found anywhere in distribution
    warning: no previously-included files matching 'coverage.xml' found anywhere in distribution
    no previously-included directories found matching 'docs/_build'
    no previously-included directories found matching 'persistent/__pycache__'
Requirement already satisfied (use --upgrade to upgrade): zope.interface in /usr/lib/python2.7/dist-packages (from BTrees>=4.1.4->dedupe==1.8.1->csvmatch)
Requirement already satisfied (use --upgrade to upgrade): setuptools in /usr/lib/python2.7/dist-packages (from zope.index->dedupe==1.8.1->csvmatch)
Requirement already satisfied (use --upgrade to upgrade): six in /usr/local/lib/python2.7/dist-packages (from zope.index->dedupe==1.8.1->csvmatch)
Downloading/unpacking python-dateutil>=2.6.0 (from datetime-distance->dedupe-variable-datetime->dedupe==1.8.1->csvmatch)
  Downloading python_dateutil-2.7.0-py2.py3-none-any.whl (207kB):   1%  4.1  Downloading python_dateutil-2.7.0-py2.py3-none-any.whl (207kB):   3%  8.2  Downloading python_dateutil-2.7.0-py2.py3-none-any.whl (207kB):   5%  12k  Downloading python_dateutil-2.7.0-py2.py3-none-any.whl (207kB):   7%  16k  Downloading python_dateutil-2.7.0-py2.py3-none-any.whl (207kB):   9%  20k  Downloading python_dateutil-2.7.0-py2.py3-none-any.whl (207kB):  11%  24k  Downloading python_dateutil-2.7.0-py2.py3-none-any.whl (207kB):  13%  28k  Downloading python_dateutil-2.7.0-py2.py3-none-any.whl (207kB):  15%  32k  Downloading python_dateutil-2.7.0-py2.py3-none-any.whl (207kB):  17%  36k  Downloading python_dateutil-2.7.0-py2.py3-none-any.whl (207kB):  19%  40k  Downloading python_dateutil-2.7.0-py2.py3-none-any.whl (207kB):  21%  45k  Downloading python_dateutil-2.7.0-py2.py3-none-any.whl (207kB):  23%  49k  Downloading python_dateutil-2.7.0-py2.py3-none-any.whl (207kB):  25%  53k  Downloading python_dateutil-2.7.0-py2.py3-none-any.whl (207kB):  27%  57k  Downloading python_dateutil-2.7.0-py2.py3-none-any.whl (207kB):  29%  61k  Downloading python_dateutil-2.7.0-py2.py3-none-any.whl (207kB):  31%  65k  Downloading python_dateutil-2.7.0-py2.py3-none-any.whl (207kB):  33%  69k  Downloading python_dateutil-2.7.0-py2.py3-none-any.whl (207kB):  35%  73k  Downloading python_dateutil-2.7.0-py2.py3-none-any.whl (207kB):  37%  77k  Downloading python_dateutil-2.7.0-py2.py3-none-any.whl (207kB):  39%  81k  Downloading python_dateutil-2.7.0-py2.py3-none-any.whl (207kB):  41%  86k  Downloading python_dateutil-2.7.0-py2.py3-none-any.whl (207kB):  43%  90k  Downloading python_dateutil-2.7.0-py2.py3-none-any.whl (207kB):  45%  94k  Downloading python_dateutil-2.7.0-py2.py3-none-any.whl (207kB):  47%  98k  Downloading python_dateutil-2.7.0-py2.py3-none-any.whl (207kB):  49%  102  Downloading python_dateutil-2.7.0-py2.py3-none-any.whl (207kB):  51%  106  Downloading python_dateutil-2.7.0-py2.py3-none-any.whl (207kB):  53%  110  Downloading python_dateutil-2.7.0-py2.py3-none-any.whl (207kB):  55%  114  Downloading python_dateutil-2.7.0-py2.py3-none-any.whl (207kB):  57%  118  Downloading python_dateutil-2.7.0-py2.py3-none-any.whl (207kB):  59%  122  Downloading python_dateutil-2.7.0-py2.py3-none-any.whl (207kB):  61%  126  Downloading python_dateutil-2.7.0-py2.py3-none-any.whl (207kB):  63%  131  Downloading python_dateutil-2.7.0-py2.py3-none-any.whl (207kB):  65%  135  Downloading python_dateutil-2.7.0-py2.py3-none-any.whl (207kB):  67%  139  Downloading python_dateutil-2.7.0-py2.py3-none-any.whl (207kB):  69%  143  Downloading python_dateutil-2.7.0-py2.py3-none-any.whl (207kB):  71%  147  Downloading python_dateutil-2.7.0-py2.py3-none-any.whl (207kB):  73%  151  Downloading python_dateutil-2.7.0-py2.py3-none-any.whl (207kB):  75%  155  Downloading python_dateutil-2.7.0-py2.py3-none-any.whl (207kB):  77%  159  Downloading python_dateutil-2.7.0-py2.py3-none-any.whl (207kB):  79%  163  Downloading python_dateutil-2.7.0-py2.py3-none-any.whl (207kB):  81%  167  Downloading python_dateutil-2.7.0-py2.py3-none-any.whl (207kB):  83%  172  Downloading python_dateutil-2.7.0-py2.py3-none-any.whl (207kB):  84%  176  Downloading python_dateutil-2.7.0-py2.py3-none-any.whl (207kB):  86%  180  Downloading python_dateutil-2.7.0-py2.py3-none-any.whl (207kB):  88%  184  Downloading python_dateutil-2.7.0-py2.py3-none-any.whl (207kB):  90%  188  Downloading python_dateutil-2.7.0-py2.py3-none-any.whl (207kB):  92%  192  Downloading python_dateutil-2.7.0-py2.py3-none-any.whl (207kB):  94%  196  Downloading python_dateutil-2.7.0-py2.py3-none-any.whl (207kB):  96%  200  Downloading python_dateutil-2.7.0-py2.py3-none-any.whl (207kB):  98%  204  Downloading python_dateutil-2.7.0-py2.py3-none-any.whl (207kB): 100%  207  Downloading python_dateutil-2.7.0-py2.py3-none-any.whl (207kB):            Downloading python_dateutil-2.7.0-py2.py3-none-any.whl (207kB): 207kB downloaded
Installing collected packages: csvmatch, doublemetaphone, colorama, chardet, jellyfish, dedupe, fastcluster, dedupe-hcluster, affinegap, categorical-distance, dedupe-variable-datetime, rlr, numpy, highered, simplecosine, haversine, BTrees, zope.index, Levenshtein-search, datetime-distance, pylbfgs, pyhacrf-datamade, persistent, python-dateutil
Compiling /tmp/pip_build_ray/csvmatch/csvmatch-titles.txt.py ...
  File "/tmp/pip_build_ray/csvmatch/csvmatch-titles.txt.py", line 1
    Mr.?
       ^
SyntaxError: invalid syntax

Cleaning up...
Exception:
Traceback (most recent call last):
  File "/usr/lib/python2.7/dist-packages/pip/basecommand.py", line 122, in main
    status = self.run(options, args)
  File "/usr/lib/python2.7/dist-packages/pip/commands/install.py", line 283, in run
    requirement_set.install(install_options, global_options, root=options.root_path)
  File "/usr/lib/python2.7/dist-packages/pip/req.py", line 1436, in install
    requirement.install(install_options, global_options, *args, **kwargs)
  File "/usr/lib/python2.7/dist-packages/pip/req.py", line 672, in install
    self.move_wheel_files(self.source_dir, root=root)
  File "/usr/lib/python2.7/dist-packages/pip/req.py", line 902, in move_wheel_files
    pycompile=self.pycompile,
  File "/usr/lib/python2.7/dist-packages/pip/wheel.py", line 206, in move_wheel_files
    clobber(source, lib_dir, True)
  File "/usr/lib/python2.7/dist-packages/pip/wheel.py", line 193, in clobber
    os.makedirs(destsubdir)
  File "/usr/lib/python2.7/os.py", line 157, in makedirs
    mkdir(name, mode)
OSError: [Errno 13] Permission denied: '/usr/local/lib/python2.7/dist-packages/csvmatch-1.13.dist-info'

Storing debug log for failure in /home/ray/.pip/pip.log

==== END OF FAILED INSTALL OUTPUT ====

maxharlow commented 6 years ago

Could you try:

pip install --user csvmatch

And if that doesn't work:

sudo pip install csvmatch
sfd99 commented 6 years ago

ok Max - almost there!! :-) did:
pip install --user csvmatch

After 5 minutes of a lot of compilations, END RESULT:

Successfully installed csvmatch doublemetaphone colorama chardet jellyfish dedupe fastcluster dedupe-hcluster affinegap categorical-distance dedupe-variable-datetime rlr numpy highered simplecosine haversine BTrees zope.index Levenshtein-search datetime-distance pylbfgs pyhacrf-datamade persistent python-dateutil Cleaning up...

AND THEN... $ csvmatch data1.csv data2.csv
--fields1 name
--fields2 'Person Name'

OR: ~$ csvmatch

OR EVEN: $ csvmatch data1.csv data2.csv --fields1 name --fields2 'Person Name'

RESULT is always: csvmatch: command not found

I wonder if I should 1st switch (ie: cd <folder)>), before running the csvmatch command in the Linux Terminal?. But which Linux folder
should I "cd" to?. (csvmatch seems to have installed ok...)

Max, thanks f/your patience and determination to help a Linux + R-citizen! :-)

maxharlow commented 6 years ago

Ok, this is progress. What we've done there is just install it for your user account only, so there are no problems with permissions. However, your computer isn't set up to look for programs only installed in that way. Running this should hopefully fix that:

echo "PATH=\$PATH:/usr/local/bin" >> ~/.bashrc
sfd99 commented 6 years ago

Max. did that and got:

~$ echo "PATH=\$PATH:/usr/local/bin" >> ~/.bashrc
~$ csvmatch
csvmatch: command not found

and also did:

~$ csvmatch.py
csvmatch.py: command not found

Just wanted to see if the call would at least "recognize" csvmatch somewhere in the "PATH"...

btw: looked in the Linux folder: /usr/local/bin but there is no csvmatch file there...

Maybe, the csvmatch file is hiding in another folder?. Mystery!

maxharlow commented 6 years ago

I see you've closed the issue -- did you manage to get it working? If not closing and reopening your terminal might do it.

sfd99 commented 6 years ago

Hi Max, No, I didn't close the issue. (I thought you had closed it...).

Anyhow, no - I could not get csvmatch to work in my Linux PC.

Again Max, thank you so much for your patience and endurance with this R-guy. :-)

We really tried... csvmatch is (obviously) an easy & useful prog, but now I'm trying to find a poor-man's (ie: non-Python) substitute... (maybe an R package? or a similar Linux app?).

Very best to you, Max!.

maxharlow commented 6 years ago

Oh, sorry to hear that. Happy to keep trying if you have a current error message or anything? Apologies it has not been more straightforward!