I get much longer results from chatgpt in playground

gblazex commented 1 year ago

e.g. How is the climate change affecting our planet's biodiversity?

the boxing site says chatgpt 3.5 turbo answer is:

Climate change is leading to temperature rises, altered rainfall patterns, and sea-level rise, all of which are negatively impacting ecosystems and causing shifts in species distribution and abundance. This loss, reduction, and displacement of wildlife and plant species can disrupt ecosystems, reduce biodiversity, and lead to the extinction of certain species.

But if i run it in API playground (with default settings except longer token length) I get a much more comprehensive answer:

https://platform.openai.com/playground/p/o9a5CO3QS6koRDN6CIAiY8Jj?model=gpt-3.5-turbo

Climate change is having a significant impact on the Earth's biodiversity. Here are some ways in which climate change is affecting our planet's biodiversity:

Habitat loss: Climate change alters temperature and rainfall patterns, leading to shifts in ecosystems. Many species rely on specific habitats for survival, and as these habitats change or disappear, they face the risk of extinction. Ecosystems such as coral reefs, polar regions, and tropical rainforests are particularly vulnerable.

Range shifts and species migration: With changing temperatures, species are forced to move to more suitable habitats. Some species may be unable to adapt or migrate fast enough, leading to reduced populations or local extinctions. This disrupts the delicate balance of ecosystems and can lead to the loss of certain species interactions.

Increased extinction risk: The combination of habitat loss, range shifts, and altered ecological interactions increases the risk of extinction for many species. Climate change acts as an additional stressor, which can push species already facing other threats to the brink of extinction.

Altered phenology and ecological mismatches: Climate change disrupts the timing of seasonal events, such as flowering, breeding, and migration. Many species rely on specific cues and timing to coordinate their life cycles and interactions. When these events are out of sync, it can lead to ecological mismatches, where species depend on resources that are no longer available or become vulnerable to predators without appropriate cover or food.

Coral bleaching and ocean acidification: Rising ocean temperatures and increased carbon dioxide levels in the atmosphere contribute to coral bleaching and ocean acidification. Coral reefs, which are one of the most diverse ecosystems on the planet, are particularly impacted, as the increased temperature and acidity stress corals, leading to their death and endangering the species that depend on them.

Disruption of food webs: Climate change can alter the availability and distribution of food sources, affecting the entire food web. For example, as Arctic sea ice declines, it reduces the hunting grounds for polar bears, disrupting their food chain. This can have cascading effects on other species in the ecosystem.

Overall, climate change is causing considerable disruption to our planet's biodiversity, threatening the survival of many species and ecosystems. Mitigating climate change and taking steps to preserve biodiversity are crucial for the long-term health of our planet.

yrvelez commented 1 year ago

I also commented on this. Even without reading any of the text, you can predict which one is GPT3.5 by simply selecting the shorter string. Research on persuasion finds that longer arguments generally tend to be seen as more persuasive and of higher quality. The ideal test would be two generations of similar length.

gblazex commented 1 year ago

"I should add that! Right now I tell the models to write as concisely as possible, max 1 paragraph"

There's too many moving variables. In a way we aren't just testing answer quality, but we are testing instruction following (be concise). And GPT 3.5 follows that instruction better. But then people will select the longer (more comprehensive) answer.

LLAMA wins in this case simply by following the instructions less closely.

Maybe ask each model to answer questions in 3 bullet points or something similar. That would be easier for each to follow in similar fashion I imagine.

cbh123 / llmboxing

I get much longer results from chatgpt in playground #2