Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

large data via chunking/sharding/cloud #20

Open
vjcitn opened this issue Sep 5, 2024 · 3 comments
Open

large data via chunking/sharding/cloud #20

vjcitn opened this issue Sep 5, 2024 · 3 comments

Comments

@vjcitn
Copy link

vjcitn commented Sep 5, 2024

open software for large data analysis tasks leads to problems of compute strategy and large data conveyance how does your project deal with this?

Session Notes

@TomNicholas
Copy link

TomNicholas commented Sep 5, 2024

This is literally what Zarr is made for, which is a numfocus-sponsored project! I'm happy to talk about how we use Zarr for climate science and oceanography data, and point you towards others who use Zarr in biomedical applications.

EDIT:

compute strategy

If you're also wondering about parallel execution frameworks I run a Pangeo working group on this, and would be happy to talk about all our experiences doing very large (TB- and PB-scale) array computations in the cloud.

@vjcitn
Copy link
Author

vjcitn commented Sep 5, 2024

great!

@MSanKeys963 MSanKeys963 added unconference To designate a proposed unconference topic for the NumFOCUS 2024 Project Summit scheduled and removed unconference To designate a proposed unconference topic for the NumFOCUS 2024 Project Summit labels Sep 5, 2024
@InessaPawson
Copy link
Contributor

Thanks for proposing the topic, @vjcitn !

This unconference is scheduled in Clara Barton Room at 5 pm ET.

For full schedule, visit: https://www.nfsummit24.com/schedule.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

4 participants