📌 Marque-pages Pinboard

← Retour à tous les marque-pages
Réinitialiser
Recherche en cours...
36 résultats (1-36 marque-pages affichés)
docs.google.com
docs.google.com
infolingu.univ-mlv.fr
github.com
Lexique 3 est une base de données qui fournit pour 135000 mots du français de nombreuses informations linguistiques ici au format SQL - WhiteFangs/lexique.sql
www.backblaze.com
Hard Drive test data from the Backblaze data center. Backblaze is affordable, easy-to-use cloud storage.
www.ipdb.org
isbndb.com
The ISBNdb book database includes over 31 million unique ISBNs with up to 19 data points per book and is searchable via API. Retrieve book information from ISBN database on the go!
www.datacomp.ai
In search of the next generation of multimodal datasets
public.work
Public Work is a visual search engine for public domain content. Explore 100,000+ copyright-free images from The MET, New York Public Library, and other sources
datasets.doctrine.fr
cinescale.github.io
Cinematic Features
cldr.unicode.org
genius.com
Genius is the world’s biggest collection of song lyrics and musical knowledge.
www.openvc.app
Raise from 5,000+ investors. For free.
cldr.unicode.org
datasets.imdbws.com
data.culture.gouv.fr
Données ouvertes (Open data) mises à disposition par le MinistÚre de la Culture et ses établissements publics.
www.pop.culture.gouv.fr
POP propose de faire des données patrimoniales un bien commun, en rendant accessibles et consultables plus de 3 millions de contenus numériques du patrimoine français.
donnees.banquemondiale.org
World Bank Open Data from The World Bank: Data
cloud.google.com
dataplane.org
Dataplane.org - For operators, by operators.
huggingface.co
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
en.wikipedia.org
github.com
One place for all the default credentials to assist the Blue/Red teamers activities on finding devices with default password đŸ›Ąïž - ihebski/DefaultCreds-cheat-sheet: One place for all the default credentials to assist the Blue/Red teamers activities on finding devices with default password đŸ›Ąïž
github.com
Label, clean and enrich text datasets with Large Language Models (LLMs) - refuel-ai/autolabel: Label, clean and enrich text datasets with Large Language Models (LLMs)
lowbackgroundsteel.ai
Sources of training data that haven’t been contaminated by AI-created content. Low Background Steel (and lead) is a type of metal uncontaminated by radioactive isopes from nuclear testing. That steel...
www.data.gouv.fr
data.gouv.fr accueil
api.gouv.fr
API Sirene est une des APIs du service public. Accéder aux informations concernant les entreprises et les établissements immatriculés au répertoire

github.com
Downloadable snapshots of the Chrome Top Million Websites pulled from public CrUX data in BigQuery. - zakird/crux-top-lists: Downloadable snapshots of the Chrome Top Million Websites pulled from public CrUX data in BigQuery.
exposing.ai
Exposing.ai
old.reddit.com
I've extracted some tables (14 485) from Statement of Financial Affairs. Might be useful for someone, or maybe someone will make some charts?...
www.courdecassation.fr
laion.ai
rated, large-scale datasets crawled from publically available internet. Our recommendation is therefore to use the dataset for research purposes. Be aware that this large-scale dataset is uncurated. Keep in mind that the uncurated nature of the dataset means that collected links may lead to stron
laion.ai