AutoViML / AutoViz

Automatically Visualize any dataset, any size with a single line of code. Created by Ram Seshadri. Collaborators Welcome. Permission Granted upon Request.
Apache License 2.0
1.71k stars 197 forks source link

encoding issue #17

Closed hakobjan closed 4 years ago

hakobjan commented 4 years ago

I am trying to use AutoViz on a large data set with a shape of (1362132, 83) THIS READS THE DATA SET

df = pd.read_csv("./Desktop/mgh_multi_gifts_desc_joined_tb_CSV.csv", error_bad_lines=False, engine='python', sep=",", encoding='cp1252'

THIS IS MY NEXT STEP

encoding = "cp1252 error_bad_lines = False engine ='python' sep = ',' target = 'gift_amount' datapath = './Desktop/' filename = 'mgh_multi_gifts_desc_joined_tb_CSV.csv' df = pd.read_csv(datapath+filename,sep=sep,index_col=None, error_bad_lines = error_bad_lines,engine = engine,encoding=encoding)

WHEN TRYING TO EXECUTE THIS NEXT STEP dft = AV.AutoViz(datapath+filename, sep=sep, depVar=target, dfte=None, header=0, verbose=0, lowess=False,chart_format='svg',max_rows_analyzed=1500,max_cols_analyzed=30)

GETTING THE FOLLOWING MASSAGE

File encoding decoder utf-8 does not work for this file File encoding decoder iso-8859-11 does not work for this file File encoding decoder cpl252 does not work for this file File encoding decoder latin1 does not work for this file None of the decoders work... Not able to read or load file. Please check your inputs and try again...

NOT SURE WHAT TO DO NEXT , ANY HELP WOULD BE MUCH APPRECIATED.

AutoViML commented 4 years ago

Yes this is real easy to fix. If you have already loaded the data into a data frame called “df” just send it to AutoViz directly as a DF as follows:

dft = AV.AutoViz(“”, sep=sep, depVar=target, dfte=df, header=0, verbose=0, lowess=False,chart_format='svg',max_rows_analyzed=1500,max_cols_analyzed=30)

Try this. It should work. Ram

On Thu, Feb 13, 2020 at 3:12 PM Hakob Avjyan notifications@github.com wrote:

I am trying to use AutoViz on a large data set with a shape of (1362132, 83) ``df = pd.read_csv("./Desktop/mgh_multi_gifts_desc_joined_tb_CSV.csv", error_bad_lines=False, engine='python', sep=",", encoding='cp1252')

THIS READS THE DATA SET

THIS IS MY NEXT STEP

encoding = "cp1252 error_bad_lines = False engine ='python' sep = ';' target = 'gift_amount' datapath = './Desktop/' filename = 'mgh_multi_gifts_desc_joined_tb_CSV.csv' df = pd.read_csv(datapath+filename,sep=sep,index_col=None, error_bad_lines = error_bad_lines,engine = engine,encoding=encoding)

WHEN TRYING TO EXECUTE THIS NEXT STEM dft = AV.AutoViz(datapath+filename, sep=sep, depVar=target, dfte=None, header=0, verbose=0, lowess=False,chart_format='svg',max_rows_analyzed=1500,max_cols_analyzed=30)

GETTING THE FOLLOWING MASSAGE

File encoding decoder utf-8 does not work for this file File encoding decoder iso-8859-11 does not work for this file File encoding decoder cpl252 does not work for this file File encoding decoder latin1 does not work for this file None of the decoders work... Not able to read or load file. Please check your inputs and try again...

NOT SURE WHAT TO DO NEXT , ANY HELP WOULD BE MUCH APPRECIATED.

— You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHub https://github.com/AutoViML/AutoViz/issues/17?email_source=notifications&email_token=AMKBH6BJSLUEO75PW6LPMWDRCWSSTA5CNFSM4KU2DEMKYY3PNVWWK3TUL52HS4DFUVEXG43VMWVGG33NNVSW45C7NFSM4INMBZ7A, or unsubscribe https://github.com/notifications/unsubscribe-auth/AMKBH6EGXMNAHUOA3T2XT3TRCWSSTANCNFSM4KU2DEMA .

hakobjan commented 4 years ago

Yes this is real easy to fix. If you have already loaded the data into a data frame called “df” just send it to AutoViz directly as a DF as follows: dft = AV.AutoViz(“”, sep=sep, depVar=target, dfte=df, header=0, verbose=0, lowess=False,chart_format='svg',max_rows_analyzed=1500,max_cols_analyzed=30) Try this. It should work. Ram

Thanks for your response Ram! what would the required filename argument be?

AutoViML commented 4 years ago

You just need to give the filename argument as an empty string “” as I have shown below. You could have cut and pasted my code below in your notebook and run it. Ram

On Thu, Feb 13, 2020 at 4:32 PM Hakob Avjyan notifications@github.com wrote:

Yes this is real easy to fix. If you have already loaded the data into a data frame called “df” just send it to AutoViz directly as a DF as follows: dft = AV.AutoViz(“”, sep=sep, depVar=target, dfte=df, header=0, verbose=0, lowess=False,chart_format='svg',max_rows_analyzed=1500,max_cols_analyzed=30) Try this. It should work. Ram … <#m8986339190359705560> On Thu, Feb 13, 2020 at 3:12 PM Hakob Avjyan @.***> wrote: I am trying to use AutoViz on a large data set with a shape of (1362132, 83) ``df = pd.read_csv("./Desktop/mgh_multi_gifts_desc_joined_tb_CSV.csv", error_bad_lines=False, engine='python', sep=",", encoding='cp1252') THIS READS THE DATA SET THIS IS MY NEXT STEP encoding = "cp1252 error_bad_lines = False engine ='python' sep = ';' target = 'gift_amount' datapath = './Desktop/' filename = 'mgh_multi_gifts_desc_joined_tb_CSV.csv' df = pd.read_csv(datapath+filename,sep=sep,index_col=None, error_bad_lines = error_bad_lines,engine = engine,encoding=encoding) WHEN TRYING TO EXECUTE THIS NEXT STEM dft = AV.AutoViz(datapath+filename, sep=sep, depVar=target, dfte=None, header=0, verbose=0, lowess=False,chart_format='svg',max_rows_analyzed=1500,max_cols_analyzed=30) GETTING THE FOLLOWING MASSAGE File encoding decoder utf-8 does not work for this file File encoding decoder iso-8859-11 does not work for this file File encoding decoder cpl252 does not work for this file File encoding decoder latin1 does not work for this file None of the decoders work... Not able to read or load file. Please check your inputs and try again... NOT SURE WHAT TO DO NEXT , ANY HELP WOULD BE MUCH APPRECIATED. — You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHub <#17 https://github.com/AutoViML/AutoViz/issues/17?email_source=notifications&email_token=AMKBH6BJSLUEO75PW6LPMWDRCWSSTA5CNFSM4KU2DEMKYY3PNVWWK3TUL52HS4DFUVEXG43VMWVGG33NNVSW45C7NFSM4INMBZ7A>, or unsubscribe https://github.com/notifications/unsubscribe-auth/AMKBH6EGXMNAHUOA3T2XT3TRCWSSTANCNFSM4KU2DEMA .

Thanks for your response Ram! what would the required filename argument be?

— You are receiving this because you commented.

Reply to this email directly, view it on GitHub https://github.com/AutoViML/AutoViz/issues/17?email_source=notifications&email_token=AMKBH6CVN42VE37ROMDLHQ3RCW36JA5CNFSM4KU2DEMKYY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOELWV6IQ#issuecomment-585981730, or unsubscribe https://github.com/notifications/unsubscribe-auth/AMKBH6FBDSCDYZK6QBEELOLRCW36JANCNFSM4KU2DEMA .