hyanwong / giglib

MIT License
4 stars 2 forks source link

Numpy edge table data #116

Closed hyanwong closed 7 months ago

hyanwong commented 7 months ago

This stores (most) table data in large memory-contiguous numpy arrays.

Ragged data such as the individuals.parents column and metadata are hacked into a separate array (and not really tested)

Summary by CodeRabbit

coderabbitai[bot] commented 7 months ago

[!WARNING]

Rate Limit Exceeded

@hyanwong has exceeded the limit for the number of commits or files that can be reviewed per hour. Please wait 21 minutes and 26 seconds before requesting another review.

How to resolve this issue? After the wait time has elapsed, a review can be triggered using the `@coderabbitai review` command as a PR comment. Alternatively, push new commits to this PR. We recommend that you space out your commits to avoid hitting the rate limit.
How do rate limits work? CodeRabbit enforces hourly rate limits for each developer per organization. Our paid plans have higher rate limits than the trial, open-source and free plans. In all cases, we re-allow further reviews after a brief timeout. Please see our [FAQ](https://coderabbit.ai/docs/faq) for further information.
Commits Files that changed from the base of the PR and between fd0b0c536bd73e7c92d84cbcde2218dc8a19b9f4 and ef6a13817e64d961e2c77ead64d46a70e4f0e59b.

Walkthrough

The GeneticInheritanceGraphLibrary has undergone a transformation, focusing on optimization and efficiency. Key changes include the shift from dataclasses to namedtuple for better memory usage and performance, refinement of data handling and validation mechanisms across tables, and enhancements in test utilities for streamlined code testing. This update signifies a move towards more efficient data structures and streamlined codebase, reflecting a thoughtful reconsideration of the library's foundational elements.

Changes

Files Change Summary
.../graph.py Shifted from dataclasses to namedtuple; updated iedges property; modified Items and IEdge classes; redefined Node and Individual classes.
.../tables.py Major restructuring; switched to namedtuple; optimized data handling and validation.
tests/conftest.py Streamlined code by replacing direct calls with add_iedge function.
tests/gigutil.py, tests/test_gigutil.py Renamed and updated add_iedge; adjusted make_diploids logic; fixed ie._asdict() usage.
tests/test_graph.py, tests/test_tables.py Added new methods and tests; improved error handling and validation; optimized test assertions.

🐰✨

In the library's heart, where data weaves and bends,
A rabbit hopped through, making amends.
From dataclasses to namedtuple, it leapt,
Optimizing the code, so carefully kept.

🌟 In tests and tables, changes abound,
Efficiency and clarity, in leaps and bounds.
Celebrate the changes, for they're quite profound!
πŸΎπŸ’«

Thank you for using CodeRabbit. We offer it for free to the OSS community and would appreciate your support in helping us grow. If you find it useful, would you consider giving us a shout-out on your favorite social media?

Share - [X](https://twitter.com/intent/tweet?text=I%20just%20used%20%40coderabbitai%20for%20my%20code%20review%2C%20and%20it%27s%20fantastic%21%20It%27s%20free%20for%20OSS%20and%20offers%20a%20free%20trial%20for%20the%20proprietary%20code.%20Check%20it%20out%3A&url=https%3A//coderabbit.ai) - [Mastodon](https://mastodon.social/share?text=I%20just%20used%20%40coderabbitai%20for%20my%20code%20review%2C%20and%20it%27s%20fantastic%21%20It%27s%20free%20for%20OSS%20and%20offers%20a%20free%20trial%20for%20the%20proprietary%20code.%20Check%20it%20out%3A%20https%3A%2F%2Fcoderabbit.ai) - [Reddit](https://www.reddit.com/submit?title=Great%20tool%20for%20code%20review%20-%20CodeRabbit&text=I%20just%20used%20CodeRabbit%20for%20my%20code%20review%2C%20and%20it%27s%20fantastic%21%20It%27s%20free%20for%20OSS%20and%20offers%20a%20free%20trial%20for%20proprietary%20code.%20Check%20it%20out%3A%20https%3A//coderabbit.ai) - [LinkedIn](https://www.linkedin.com/sharing/share-offsite/?url=https%3A%2F%2Fcoderabbit.ai&mini=true&title=Great%20tool%20for%20code%20review%20-%20CodeRabbit&summary=I%20just%20used%20CodeRabbit%20for%20my%20code%20review%2C%20and%20it%27s%20fantastic%21%20It%27s%20free%20for%20OSS%20and%20offers%20a%20free%20trial%20for%20proprietary%20code)

Tips ### Chat There are 3 ways to chat with CodeRabbit: - Review comments: Directly reply to a review comment made by CodeRabbit. Example: - `I pushed a fix in commit .` - `Generate unit-tests for this file.` - `Open a follow-up GitHub issue for this discussion.` - Files and specific lines of code (under the "Files changed" tab): Tag `@coderabbitai` in a new review comment at the desired location with your query. Examples: - `@coderabbitai generate unit tests for this file.` - `@coderabbitai modularize this function.` - PR comments: Tag `@coderabbitai` in a new PR comment to ask questions about the PR branch. For the best results, please provide a very specific query, as very limited context is provided in this mode. Examples: - `@coderabbitai generate interesting stats about this repository and render them as a table.` - `@coderabbitai show all the console.log statements in this repository.` - `@coderabbitai read src/utils.ts and generate unit tests.` - `@coderabbitai read the files in the src/scheduler package and generate a class diagram using mermaid and a README in the markdown format.` Note: Be mindful of the bot's finite context window. It's strongly recommended to break down tasks such as reading entire modules into smaller chunks. For a focused discussion, use review comments to chat about specific files and their changes, instead of using the PR comments. ### CodeRabbit Commands (invoked as PR comments) - `@coderabbitai pause` to pause the reviews on a PR. - `@coderabbitai resume` to resume the paused reviews. - `@coderabbitai review` to trigger a review. This is useful when automatic reviews are disabled for the repository. - `@coderabbitai resolve` resolve all the CodeRabbit review comments. - `@coderabbitai help` to get help. Additionally, you can add `@coderabbitai ignore` anywhere in the PR description to prevent this PR from being reviewed. ### CodeRabbit Configration File (`.coderabbit.yaml`) - You can programmatically configure CodeRabbit by adding a `.coderabbit.yaml` file to the root of your repository. - The JSON schema for the configuration file is available [here](https://coderabbit.ai/integrations/coderabbit-overrides.v2.json). - If your editor has YAML language server enabled, you can add the path at the top of this file to enable auto-completion and validation: `# yaml-language-server: $schema=https://coderabbit.ai/integrations/coderabbit-overrides.v2.json` ### CodeRabbit Discord Community Join our [Discord Community](https://discord.com/invite/GsXnASn26c) to get help, request features, and share feedback.
hyanwong commented 7 months ago

Probably worth merging, as it actually cuts the number of lines down a little (even though it does make the simulations at the scale I am using about 1% slower. Given that using numpy arrays opens up the possibility of using number, I think we should just merge this.