mastouille.fr est l'un des nombreux serveurs Mastodon indépendants que vous pouvez utiliser pour participer au fédiverse.
Mastouille est une instance Mastodon durable, ouverte, et hébergée en France.

Administré par :

Statistiques du serveur :

575
comptes actifs

#dataprocessing

1 message1 participant0 message aujourd’hui

Going to other clubs: 😵😒🍹
Going to GBIF clubs: 😎💚🧬#DataUseClub

Registrations are open for our next Data Use Club practical session - Data standards & processing!

This session aims to give users the tools to understand the context of the data available on GBIF and help them decide if the data is fit for their analyses.

📍 Wed 12 Feb 2025 15:00-16:30 CET. This event will be recorded.

🔗 gbif.link/datauseclub-practica

🧐 In threat intelligence, you often have to deal with a bunch of different data sources, but these data can come in different forms and need to be processed before they can be analyzed!

Well, that’s exactly what you can do with Docling, an open-source library that allows you to process different kinds of data (PDF, DOCX, PPTX, XLSX, images, and more). Bonus: you can also use it for chunking your data, for example for a RAG.

One of the easiest ways to use it is as follows, where I pass a PDF report and convert it into JSON and Markdown, and now it is much easier to process further and extract additional details without polluting my pipeline with garbage data from the file structure.

And it supports OCR 🤓

You can find my code below 👇

➡️ github.com/DS4SD/docling/tree/

➡️ Code: gist.github.com/fr0gger/251cf8

👋 Hey All!

❓ Have you heard that since February Pydantic is not only an Open Source project but also a company that is based on the principles that have led to Pydantic's success?

🛣️ Well, today you have a chance shaping the roadmap for the Pydantic Inc.

📰 Read about the roadmap, ✅ take survey, 🔮 shape the future of Pydantic.

pydantic.dev/roadmap/

PydanticHelp us build our roadmap | PydanticWhat we're building, and how you can help plan it.

👋 Hey, everybody!

📑 While you were providing your feedback (🧵) the team behind Pydantic were busy with fixing bugs of cause and also they were updating the docs to make Pydantic v2 easy to migrate to and easy to start development with.

🫶 Over 2000 lines of just docs were changed since then (see the image). Kudos to everyone who is preparing Pydantic v2 to the final release!

Using Satellite Data For Species Distribution Modeling With GRASS GIS And R [video tutorial]
--
youtu.be/MLhrhUfPzZk <-- shared tutorial video
--
“Species distribution models (SDM) have traditionally used climatic data as predictors of habitat suitability for the target species. In this hands-on studio, ‘we’ will explore the use of satellite data to derive relevant predictors. ‘We’ will perform satellite data processing, from download to analysis, using GRASS GIS software functionality. Then, ‘we’ll’ read our predictors within R and perform SDM, visualize and analyze results. Finally, ‘we’ will write the output distribution maps back into GRASS…”
#GIS #spatial #mapping #spatialanalysis #tutorial #onlinelearning #software #video #R #SDM #speciesdistributionmodel #GRASS #model #modeling #remotesensing #satellite #predictors #dataprocessing #download #opendata #openaccess #opensource #visualisation #map