Open SyureNyanko opened 3 years ago
I've explored in vaex source code, and I'll make some notes.
Vaex seems to be using python's ast parse to achieve query-like(df['a' > 10]) operations in expresso.py (about line 634).
So, characters that can be used for column names seem to be limited to characters that can be used in Python variables when using virtual column.
For example, "ππ" seems to be a character that cannot be used in column names.
import vaex
data = {'A':[1,2,3],'B':['a','b','c']}
df = vaex.from_dict(data)
df["ππ"] = df['A']
print(df)
# AttributeError: 'Subscript' object has no attribute 'id'
And, "1" and "οΌ" are interpreted as being the same in python ast and end up being inconsistent in vaex.
(base) PS C:\Users\heya\vaex-test> python
Python 3.9.5 (default, May 18 2021, 14:42:02) [MSC v.1916 64 bit (AMD64)] :: Anaconda, Inc. on win32
Type "help", "copyright", "credits" or "license" for more information.
>>> var1 = "hogehoge"
>>> print(var1)
hogehoge
>>> print(varοΌ)
hogehoge
>>>
Please tell me if I'm wrong. Thanks!
Hi.π
Certain characters cannot be used for virtual column names, and some multi-byte characters seem to correspond to that. for example, "οΌ", "οΌ" .and "β ", "β ‘"
Error messages (below) says
RuntimeError: Oops, requesting column I from dataset, but it does not exist
, but I used "β " not "I".If there is any information you need, please let me know. Thank you!
Software information
import vaex; vaex.__version__)
: {'vaex': '4.4.0', 'vaex-core': '4.4.0', 'vaex-viz': '0.5.0', 'vaex-hdf5': '0.9.0', 'vaex-server': '0.6.0', 'vaex-astro': '0.8.3', 'vaex-ui': '0.3.0', 'vaex-jupyter': '0.6.0', 'vaex-ml': '0.13.0', 'vaex-graphql': '0.2.0'}>>>