stable_datasets.features package
Submodules
stable_datasets.features.array module
Array-based feature codecs.
- class Array3D(shape: tuple, dtype: str = 'uint8')[source]
Bases:
FeatureTypeFixed-shape 3D array stored as flat bytes.
- format(value, *, format_type: str, decode_images: bool = True, cache_dir: Path | None = None)[source]
- to_arrow_type() DataType[source]
stable_datasets.features.base module
Core feature descriptors shared across modalities.
- class ClassLabel(names: list[str] | None = None, num_classes: int | None = None)[source]
Bases:
FeatureTypeCategorical label with name-to-int mapping.
- encode(value, *, cache_dir: Path | None = None)[source]
- format(value, *, format_type: str, decode_images: bool = True, cache_dir: Path | None = None)[source]
- int2str(idx: int) str[source]
- str2int(name: str) int[source]
- to_arrow_type() DataType[source]
- class FeatureType[source]
Bases:
objectBase class for feature type descriptors.
- encode(value, *, cache_dir: Path | None = None)[source]
- fingerprint_data() str[source]
- format(value, *, format_type: str, decode_images: bool = True, cache_dir: Path | None = None)[source]
- to_arrow_type() DataType[source]
- class Sequence(feature: FeatureType)[source]
Bases:
FeatureTypeVariable-length list of a sub-feature.
- encode(value, *, cache_dir: Path | None = None)[source]
- to_arrow_type() DataType[source]
- class Value(dtype: str)[source]
Bases:
FeatureTypeScalar value type. Maps dtype strings to PyArrow types.
- format(value, *, format_type: str, decode_images: bool = True, cache_dir: Path | None = None)[source]
- to_arrow_type() DataType[source]
stable_datasets.features.image module
Image feature codec.
- class Image(encode_format: str = 'PNG')[source]
Bases:
FeatureTypeImage feature stored as raw bytes in Arrow.
- format(value, *, format_type: str, decode_images: bool = True, cache_dir: Path | None = None)[source]
- to_arrow_type()[source]
stable_datasets.features.video module
Video feature codec and lazy reference objects.
- class Video(storage: str = 'path', allowed_extensions: tuple[str, ...] = ('.mp4', '.avi', '.mov', '.webm', '.mkv'))[source]
Bases:
FeatureTypeVideo feature with validated path, bytes, or specialized frame storage.
- encode(value, *, cache_dir: Path | None = None)[source]
- fingerprint_data() str[source]
- format(value, *, format_type: str, decode_images: bool = True, cache_dir: Path | None = None)[source]
- to_arrow_type() DataType[source]
Module contents
Feature codec modules.
- class Array3D(shape: tuple, dtype: str = 'uint8')[source]
Bases:
FeatureTypeFixed-shape 3D array stored as flat bytes.
- format(value, *, format_type: str, decode_images: bool = True, cache_dir: Path | None = None)[source]
- to_arrow_type() DataType[source]
- class ClassLabel(names: list[str] | None = None, num_classes: int | None = None)[source]
Bases:
FeatureTypeCategorical label with name-to-int mapping.
- encode(value, *, cache_dir: Path | None = None)[source]
- format(value, *, format_type: str, decode_images: bool = True, cache_dir: Path | None = None)[source]
- int2str(idx: int) str[source]
- str2int(name: str) int[source]
- to_arrow_type() DataType[source]
- class FeatureType[source]
Bases:
objectBase class for feature type descriptors.
- encode(value, *, cache_dir: Path | None = None)[source]
- fingerprint_data() str[source]
- format(value, *, format_type: str, decode_images: bool = True, cache_dir: Path | None = None)[source]
- to_arrow_type() DataType[source]
- class Image(encode_format: str = 'PNG')[source]
Bases:
FeatureTypeImage feature stored as raw bytes in Arrow.
- format(value, *, format_type: str, decode_images: bool = True, cache_dir: Path | None = None)[source]
- to_arrow_type()[source]
- class Sequence(feature: FeatureType)[source]
Bases:
FeatureTypeVariable-length list of a sub-feature.
- encode(value, *, cache_dir: Path | None = None)[source]
- to_arrow_type() DataType[source]
- class Value(dtype: str)[source]
Bases:
FeatureTypeScalar value type. Maps dtype strings to PyArrow types.
- format(value, *, format_type: str, decode_images: bool = True, cache_dir: Path | None = None)[source]
- to_arrow_type() DataType[source]
- class Video(storage: str = 'path', allowed_extensions: tuple[str, ...] = ('.mp4', '.avi', '.mov', '.webm', '.mkv'))[source]
Bases:
FeatureTypeVideo feature with validated path, bytes, or specialized frame storage.
- encode(value, *, cache_dir: Path | None = None)[source]
- fingerprint_data() str[source]
- format(value, *, format_type: str, decode_images: bool = True, cache_dir: Path | None = None)[source]
- to_arrow_type() DataType[source]