SIS-Team-24 / 2023-SIS-Team-24

Using Natural Language Processing, Text Insights examines and provides insights to text.
1 stars 0 forks source link

Sentiment API call failure when token length exceeding 512 #193

Closed henrygoodman closed 1 year ago

henrygoodman commented 1 year ago

Input (532 words)

41029 Engineering Research Preparation Preliminary Research Major: Software Engineering Henry Goodman 13032204 03-09-2023Research Interest – Digital Rights Management relating to PDF In recent years, the digital landscape has undergone rapid evolution, resulting in an unprecedented surge in data creation and sharing. This surge highlights the critical need for robust security and controlled access mechanisms, particularly within document management, with a specific focus on Digital Rights Management (DRM) applied to PDF documents. This research project is dedicated to exploring innovative methods for securing document distribution and management through DRM technologies. The primary research questions guiding this endeavour are as follows: o Effective DRM Strategies for Selective Content Access: What are the most effective strategies for achieving selective content access and controlled data distribution within digital documents, especially PDFs? How can these strategies enhance data security and privacy in a DRM context? o Integration of Encryption, Access Control, and Data Extraction in PDFs: What challenges and opportunities arise from the seamless integration of encryption, access control, and selective data extraction mechanisms in digital document management, with a specific emphasis on PDF files? How can these elements be harnessed to optimize document security while ensuring user convenience and usability? These research questions aim to delve into the intricacies of document security and distribution, with a particular focus on how DRM technologies can effectively address these challenges. Furthermore, within the domain of Software Engineering, this research will explore the development and implementation of systems that facilitate secure PDF document distribution among various stakeholders, considering the complexities of DRM within this context. In the article titled "Digital Rights Management," published in IEEE Potentials by Subramanya and Yi in 2006, the authors provide an overview of DRM concepts and their application in consumer CDs and DVDs. The article's structure begins with an abstract, introducing the topic in a broad manner, and then progressively delves into key points. Analysing the article's objective based on McCombes' method (2022), it can be characterized as descriptive and exploratory, as it provides detailed descriptions of DRM concepts. Similarly, in their 2011 article titled "Music Downloads and the Flip Side of Digital Rights Management", Vernik et al. caution against the application of DRM in music downloads, emphasizing potential limitations on consumer rights and industry profitability. While structurally like the previous article, this report introduces a model to analyse consumer behaviour in response to DRM. Here, the article's objective can be characterized as Here, the article's objective can be characterized as Here, the article's objective can be characterized as Here, the article's objective can be characterized as 

Sentiment Result: { "status_code": 500, "detail": "Server error: You need to specify either text or text_target.", "headers": null }

image

@Rgonzales4 I think this might just be a key error (maybe when we summarise, we return an object without a text key which may be required to pass to the sentiment analyser)

henrygoodman commented 1 year ago

Running with > 512 tokens, text_to_analyze is:

{'summary': 'The assessment criteria for this assessment task is made in alignment with AQF 8 descriptors. 41029 is a part of the Capstone project. Using previous submissions to other subjects is not going to be accepted. The completion of this task is based on the presumption that the student has regular communication with the supervisor. Insufficient understanding of the project and progress by the supervisor has a significant impact on the evaluation and marking. The essay is presented and organised with some areas needing improvement. There is limited or inaccurate referencing in places. Referencing might also be absent. Not yet passed Less than 50% The proposal has not yet demonstrated the level required for a pass. The proposal is wellPresented and organised. Accurately referenced throughout with minor or occasional lapses. Project plan demonstrates engineering rationality and systems thinking. It demonstrates rudimentary project management skills including time management and mitigation of uncertainties and risks.', 'keywords': {'proposal': 15, 'project': 14, 'engineering': 13, 'research': 8, 'problem': 8, 'professional': 8, 'presented': 8, 'management': 8, 'understanding': 6, 'reader': 5}}

< 512 tokens:

Task 3: Proposal Research Project The assessment criteria for this assessment task is made in alignment with 
AQF 8 descriptors. https://www.aqf.edu.au/framework/aqflevels#toc-aqf-level-8criteria-2 Components and submission requirements: � As per the template provided on Canvas � Must be individual submission � Communication log with 
supervisor attached in appendix � Length: 2000-2500 words � One submission onlyHigh Distinction 85% ~ 100% An outstanding proposal that provides the reader with a comprehensive and insightful understanding of proposed research project responding to a complicated or complex engineering problem. The proposal resembles a highly professional project proposal for an engineering R&D or an academic research project. Definition of the engineering problem is focused and clear. Relevant research is reviewed. Up to date and appropriate methodology is presented. Project 
plan demonstrates highly proficient engineering rationality and systems thinking. The proposal demonstrates professional project management skills including time management and mitigation of uncertainties and risks. Language use is error free and of a high degree of professional accuracy. The proposal is extremely well presented and organised. Accurately referenced throughout.Distinction 75% ~ 84% A superior proposal that provides the reader with a solid understanding of the proposed research project responding to a complicated or complex engineering problem. The proposal resembles a professional project proposal for an engineering R&D or an academic research project. Definition of the engineering problem is focused and clear. Relevant research is reviewed. Up to-date and appropriate methodology is presented. Project plan demonstrates proficient engineering rationality and systems thinking. 
The proposal demonstrates professional project management skills including time management and mitigation of uncertainties and risks Language use is largely error free and a degree of professional accuracy. The proposal is well presented and organised. Accurately referenced throughout with minor or occasional lapses. Credit 65% ~ 74% The proposal provides the reader with a good understanding of the proposed project responding to a complicated or complex engineering problem. Definition of the engineering problem is clear. Relevant research is reviewed. Up to-date and appropriate methodology is presented. The proposal demonstrate project management skills including time management and mitigation of uncertainties and risks. Language use is comprehensible and mostly professional with 
some lapses in fo

(This is just the input text raw)

So, looks like running through get summary reduces the text but leaves us with an object with a result key. Fix should be just to set text_to_analyze = get_summary(input_text)['summary']