anuj-ti / report

0 stars 0 forks source link

Poor Quality Answers #5

Open anuj-ti opened 1 year ago

anuj-ti commented 1 year ago
id: b4c2048c12d1bfa35898747ee33cfb4c

Question: 'what aws technologies you know?' Answer: "I don't know."

Chunks:

[
    "1NMRnF1Of7drtPTRNDADMvgTx9lW3rq5OaI1xnKGzgAg_26",
    "1edVCFrXMV78D2wjLfUeWGKNdy_cCEkDk76H5aQ1qbuo_13",
    "1qlDAIBVpuxHgkYdxmOqwnawgvNIPNgCit6lJCRTVVCA_2"
]

Intermediate Steps:

[
    " None.",
    " Deviation: Allow full access to the AWS account",
    "
    None."
]

In the chunks, there was a paragraph containing the information:
Use the Kubernetes Control Plane for deployments.\n\n|\n|THE PROBLEM |How to deploy DevFlows flows?\n\n|OPTIONS CONSIDERED +\n(Decision in bold) a|\n[arabic]\n. {blank}\n+\n\nUse AWS services, including Lambda, SNS, etc.\n\n. {blank}\n+\n\nUse the Kubernetes Control Plane for deployments.\n\n\n|REASONING a|\nThe team initially tried to use the AWS APIs but had to deal with the\ncomplexities of API Gateway, SNS, SQS, Lambda and IAM rules. They kept\nrunning into problems, and the original DevFlows was a Proof of Concept\nand needed to be done quickly.

This information was not extracted. Loss of information occurs at the intermediate steps only

rakshit-ti commented 1 year ago

Experiment: Scoring of individual answers with default map_rerank prompt

Description: When scoring individual answers using the default map_rerank prompt, several answers received a score of 100. However, this high score does not provide significant help in combining the answers.

Steps:

  1. Use the default map_rerank prompt for scoring individual answers with the chunk mentioned in the issue description as input
  2. Observe the scores of the answers provided.

Expected Result: The scoring should provide more diverse and informative scores to assist in combining the answers effectively.

Actual Result: Multiple answers received a score of 100, which does not contribute significantly to the process of combining the answers. Following had a score of 100: { "intermediate_steps": [ { "answer": "Fargate was considered as an alternative to Devflows because it allows AWS experts to build solutions using Lambda functions, Step Functions and/or Fargate, which require coding, whereas DevFlows is a visual no-code environment.", "score": "100" }, { "answer": "Fargate was not considered as an alternative to Devflows.", "score": "100" }, { "answer": "Fargate was not considered as an alternative to Devflows.", "score": "100" }, { "answer": "Fargate was considered as an alternative to Devflows because it has the elasticity required, can scale to zero, and provides the relevant level of control for running actions as jobs.", "score": "100" }, { "answer": "To avoid the limitations of Lambdas and to provide message passing, orchestration and scale-to-zero serverless compute.", "score": "100" }, { "answer": "Fargate was considered as an alternative to Devflows because it allows an action to deliver its results by using additional infrastructure such as Fargate tasks, EventBridge and/or SNS notifications, or webhooks from external systems.", "score": "100" }, { "answer": "Fargate was considered as an alternative to Devflows because it could simplify the system by running all short-running invocables via Lambda and all long-running invocables on Fargate for EKS.", "score": "100" }, { "answer": "Fargate was not considered as an alternative to Devflows.", "score": "100" }, { "answer": "Fargate was considered as an alternative to Devflows because it allows for the execution of long-running tasks without the need for the flow developer to estimate execution time.", "score": "100" } ], "output_text": "Fargate was considered as an alternative to Devflows because it allows AWS experts to build solutions using Lambda functions, Step Functions and/or Fargate, which require coding, whereas DevFlows is a visual no-code environment." }

NOTE: Final answer in this case was acceptable, "Fargate was considered as an alternative to Devflows because it allows AWS experts to build solutions using Lambda functions, Step Functions and/or Fargate, which require coding, whereas DevFlows is a visual no-code environment." Next Steps:

rakshit-ti commented 1 year ago

Experiment with 16K window:

rakshit-ti commented 1 year ago
rakshit-ti commented 1 year ago

Conclusion

rakshit-ti commented 1 year ago

16-K window in query-pipeline