Kee2u / Deciphering_the_Indus_Valley_Script

Deciphering the Indus script using pandas, PostgreSQL on an AWS RDS instance, Machine Learning and Statistical Analysis. Results visualized using Tableau and Javascript.
13 stars 6 forks source link

Project/Dev Progress #42

Open einthusan opened 1 year ago

einthusan commented 1 year ago

Hello,

I was wondering if there was any progress on this project? I would be interested in helping with this.

Thanks, Einthusan

jjasim commented 1 year ago

Hello,

I'd be interested too, especially in preparing a properly labelled dataset of the Sangam works to represent Old Tamil, for more robust analysis.

Thanks

einthusan commented 1 year ago

What I noticed is that Mahadevan assumes the "an" as the most frequent terminator in Indus scripts because the Sangam literature represented a patriarchal culture whereas Indus Valley was matriarchal. However, Wells was I believe more accurate in claiming it was "ay/ai" as in Cow or Mother in Old Tamil. I believe it may have been actually mother and later day Vedic schools had brought the cow worship cult which doesn't seem so prevalent in Indus culture as all animals were treated similarly (even the unicorn was probably just a dominant clan symbol which shows no continuity to later day Vedic cow worship cult). Anyhow, my point being that Sangam works do have Old Tamil words but the context is remote to Indus as culture had shifted from matriarchy to patriarchy. Closest and oldest useful works would be Ainthiram (Tamil) which is often never talked about but Tolkkappiyar is said to have been well versed in it according to the preface of Tolkkappiyam. I have the PDF and the English translation which was based on a Tamil translation was totally wrong in my opinion because the translators did not have a grounded context of the materialistic non-vedic nature of the Old Tamil world. We should start at properly translating Ainthiram to have any chance at decipherment. Would like to hear your opinion.

jjasim commented 1 year ago

Hi so sorry for the late reply, I missed the notification for this.

Your idea does seem interesting and make sense. But does the Ainthiram survive to today? I thought it was a lost work! If you have a copy of it, please do share, I would love to take a look. Do you know when the text is dated to?

einthusan commented 1 year ago

Hey,

Sorry for my late response as well. Been a little busy. Below I have attached some useful things, starting with my wife and I giving an initial try at a proper English translation of the works which never went that far but was enlightening for us. Then there is a download link that will expire in 7 days, let me know if u need me to re-upload it, but it has 2 books. The Tamil re-print in 1986 and the English translation version printed in 1997 or so. Ignore all the preface and introductory stuff, and get to the actual verses which are numbered, as I have shown in the attached sample picture. Let me know how I can help.

Edit: I totally disagree with the English translation of 1997, and I feel it is so useless that it shouldn't even be considered at all.

Cheers, Einthusan

Sample translation by us Aintiram Translation.docx

Ainthiram Books (Tamil and English w/Translations) https://we.tl/t-7IvFw99vAv

Example:

Screen Shot 2023-08-26 at 5 27 28 PM
Kee2u commented 1 year ago

Hi there,

Thank you so much for your interest. I started this project to switch careers from Mechanical Engineering to Data Engineering but once I did, I put this on the backburner. This looks promising, would you two be available for a call sometime? I wonder if we could use LLMs to help with the decipherment too.

einthusan commented 1 year ago

@Kee2u Yes, I would be available for a call, and this is something of my passion. I could afford to help fund this a bit too, to help with any associated costs to move this forwards. I am in Toronto time, just fyi. My email is just my first name @ gmail.com. I will be available starting Sept 1.

Cheers, Einthusan

aravindarc commented 1 year ago

I am also interested in this work. Since Indus script is considered largely to be Logograms, I thought of applying Tholkappiam's Soladhigaram (mainly the punarchi vidhigal) to Indus Logograms, to find if they abide by a similar grammar( and if they do, identify nouns and verbs). I wasn't aware of Ainthiram.

@Kee2u How do I get access to the Indus Script corpus, I found this webapp https://indusscript.in/, which seems to have concordance also, but it couldn't be crawled as it is rendered as a canvas. I want to do the same thing that you've done, load the indus logograms into a Vector database and attach all metadata as vectors. Any help here would be appreciated.

@einthusan I would be happy to help you in translating Ainthiram, and could you also share the Tamil version.

jjasim commented 1 year ago

@aravindarc About the https://indusscript.in/, yeah that's a great pity as I really wanted to compile a dataset of all the Indus inscriptions and the other associated information to each inscription as a dataset (if you are aware of such a dataset, I'd be happy to know). Having a comprehensive and useable dataset that is available to all is key for more quality decipherment efforts on the script IMO. I'd be happy to help if you have any ideas for compiling the https://indusscript.in/ or some other source into a nice dataset.

As for your logogram idea, you might be interested in this reddit post I came across a while ago: https://www.reddit.com/r/Dravidiology/comments/15n60zd/some_visualisations_of_acai_split_sangam_era_poet/

aravindarc commented 1 year ago

@jjasim I have written to RMRL, to see if they can provide access to the database. I am waiting for their reply. I will update this thread If I get my hands on something. We can look for acquiring other datasets after that, if RMRL refuses.

einthusan commented 11 months ago

@aravindarc @jjasim Please see download link. It expires in 7 days because of WeTransfer, which is where I uploaded it due to 25MB file size limit on Github. Below is an image showing what files can be found inside.

https://we.tl/t-bqyH2utHbx

Tamil Books Indus

I am ready to help fund this work. I am really interested in translating Ainthiram first, then using that for further decipherment. I am available to meet up remotely or in person. Do let me know what you all think.

PS: Ainthiram starts at page 30, and Tolkkapiam starts at page 23. I have added Lokayata book by Debiprasad, which I believe contains the culture for Indus Valley. I haven't seen any sources out there that make this connection, but I strongly believe the philosophy of the Indus Valley should be of Lokayatham (Carvakam).