CelebA

E431002

benchmark dataset face attributes dataset image dataset

CelebA is a large-scale face attributes dataset widely used in computer vision research for tasks like facial recognition, attribute prediction, and generative modeling.

Try in SPARQL Jump to: Surface forms Statements Referenced by

All labels observed (2)

Label	Occurrences
CelebA canonical	1
CelebA-HQ	1

Statements (72)

Predicate	Object
instanceOf	benchmark dataset ⓘ face attributes dataset ⓘ image dataset ⓘ
contains	celebrity face images ⓘ face images ⓘ
domain	computer vision ⓘ
hasApproximateNumberOfImages	200000 ⓘ
hasAttribute	5 o Clock Shadow ⓘ Arched Eyebrows ⓘ Attractive ⓘ Bags Under Eyes ⓘ Bald ⓘ Bangs ⓘ Big Lips ⓘ Big Nose ⓘ Black Hair ⓘ Blond Hair ⓘ Brown Hair ⓘ Bushy Eyebrows ⓘ Chubby ⓘ Double Chin ⓘ Eyeglasses ⓘ Goatee ⓘ Gray Hair ⓘ Heavy Makeup ⓘ High Cheekbones ⓘ Male ⓘ Mouth Slightly Open ⓘ Mustache ⓘ Narrow Eyes ⓘ No Beard ⓘ No Eyewear ⓘ Oval Face ⓘ Pale Skin ⓘ Pointy Nose ⓘ Receding Hairline ⓘ Rosy Cheeks ⓘ Sideburns ⓘ Smiling ⓘ Straight Hair ⓘ Wavy Hair ⓘ Wearing Earrings ⓘ Wearing Hat ⓘ Wearing Lipstick ⓘ Wearing Necklace ⓘ Wearing Necktie ⓘ Young ⓘ Young vs Old ⓘ
hasAttributeType	binary facial attributes ⓘ
hasDataSplit	test set ⓘ training set ⓘ validation set ⓘ
hasImageResolution	178x218 ⓘ
hasLicense	research only ⓘ
hasNumberOfAttributes	40 ⓘ
hasNumberOfIdentities	10000 ⓘ
hasProvider	The Chinese University of Hong Kong NERFINISHED ⓘ
isPopularBenchmarkFor	face attribute classification ⓘ face editing ⓘ identity-preserving generation ⓘ
relatedDataset	CelebA-HQ NERFINISHED ⓘ CelebAMask-HQ NERFINISHED ⓘ
usedFor	GAN training ⓘ attribute-conditioned image generation ⓘ domain adaptation in vision ⓘ face detection research ⓘ face recognition research ⓘ facial attribute prediction ⓘ fairness analysis in face recognition ⓘ generative modeling of faces ⓘ image-to-image translation ⓘ representation learning ⓘ

How these facts were elicited

The pipeline generated the facts above by prompting gpt-5.1 with this entity's name + description and the instruction below.

Instruction

You are a knowledge base construction expert. Given a subject entity and a description of it, return factual statements that you know for the subject as a JSON list of dictionaries(triples), where keys must be "subject", "predicate" and "object". The number of facts may be very high, between 25 to 50 or more, for very popular subjects. For less popular subjects, the number of facts can be very low, like 5 or 10.

# Requirements
- If you don't know the subject at all, return an empty list.
- If the subject is not a named entity, return an empty list.
- Include at least one triple where predicate is "instanceOf".
- Do not get too wordy.
- Separate several objects into multiple triples with one object.

Input

Subject: CelebA
Description of subject: CelebA is a large-scale face attributes dataset widely used in computer vision research for tasks like facial recognition, attribute prediction, and generative modeling.

Referenced by (2)

Full triples — surface form annotated when it differs from this entity's canonical label.

torchvision (ecosystem) → dataset → CelebA ⓘ

subject surface form: torchvision

StyleGAN → trainingDataset → CelebA ⓘ

this entity surface form: CelebA-HQ