o ‘id5d6„Z-eehd7£ƒge)edgd'geeddd(d)dgd'ge*edƒgd'geeddd*d)geed+dd(d)gd8œ dd-d9ddd.d$e+d/d0ƒe+d1d2ƒfdddd8œ d:d;„ƒZ.dS)?zÞLabeled Faces in the Wild (LFW) dataset This dataset is a collection of JPEG pictures of famous people collected over the internet, all details are available on the official website: http://vis-www.cs.umass.edu/lfw/ éN)ÚIntegralÚReal)ÚPathLikeÚlistdirÚmakedirsÚremove)ÚexistsÚisdirÚjoin)ÚMemoryé)ÚBunch)ÚHiddenÚIntervalÚ StrOptionsÚvalidate_params)Útarfile_extractallé)ÚRemoteFileMetadataÚ _fetch_remoteÚ get_data_homeÚ load_descrzlfw.tgzz.https://ndownloader.figshare.com/files/5976018Z@055f7d9c632d7370e6fb4afc7468d40f970c34a80d4c6f50ffec63f5a8d536c0)ÚfilenameÚurlZchecksumzlfw-funneled.tgzz.https://ndownloader.figshare.com/files/5976015Z@b47c8422c8cded889dc5a13418c4bc2abbda121092b3533a83306f90d900100aúpairsDevTrain.txtz.https://ndownloader.figshare.com/files/5976012Z@1d454dada7dfeca0e7eab6f65dc4e97a6312d44cf142207be28d688be92aabfaúpairsDevTest.txtz.https://ndownloader.figshare.com/files/5976009Z@7cb06600ea8b2814ac26e946201cdb304296262aad67d046a16a7ec85d0ff87cú pairs.txtz.https://ndownloader.figshare.com/files/5976006Z@ea42330c62c92989f9d7c03237ed5d591365e89b3e649747777b70e692dc1592Téçð?c Cs<t|d}t|dƒ}t|ƒst|ƒtD]$}t||jƒ}t|ƒs8|r2t d|j¡t ||||dqt d|ƒ‚q|rCt|dƒ}t} nt|dƒ}t} t|ƒsšt|| jƒ} t| ƒsp|rjt d| j¡t | |||dnt d| ƒ‚d d l }t d|¡| | d¡}t||d Wd ƒn1s‘wYt| ƒ||fS)z0Helper function to download any missing LFW data)Ú data_homeÚlfw_homezDownloading LFW metadata: %s)ÚdirnameÚ n_retriesÚdelayz %s is missingZlfw_funneledZlfwz!Downloading LFW data (~200MB): %srNz$Decompressing the data archive to %szr:gz)Úpath)rr rrÚTARGETSrÚloggerÚinforrÚOSErrorÚFUNNELED_ARCHIVEÚARCHIVEÚtarfileÚdebugÚopenrr) rÚfunneledÚdownload_if_missingr"r#r ÚtargetZtarget_filepathÚdata_folder_pathÚarchiveÚarchive_pathr+Úfp©r5úd/var/www/html/eduruby.in/lip-sync/lip-sync-env/lib/python3.10/site-packages/sklearn/datasets/_lfw.pyÚ_check_fetch_lfwMsF ÿù ÿÿr7cCs²zddlm}Wntytdƒ‚wtddƒtddƒf}|dur%|}ntdd„t||ƒDƒƒ}|\}}|j|j|jp>d}|j|j|jpId} |dur_t |ƒ}t ||ƒ}t || ƒ} t|ƒ} |sqtj | || ftjd }ntj | || d ftjd }t|ƒD]U\}} |ddkr”t d|d| ¡| | ¡}| |j|j|j|jf¡}|dur±| | |f¡}tj|tjd }|jdkrÄtd | ƒ‚|d}|sÐ|jdd}|||df<q|S)zInternally used to load imagesr)ÚImagez¨The Python Imaging Library (PIL) is required to load data from jpeg files. Please refer to https://pillow.readthedocs.io/en/stable/installation.html for installing PIL.éúNcss|] \}}|p |VqdS)Nr5)Ú.0ÚsZdsr5r5r6Ú ’s€z_load_imgs..r©ZdtyperièzLoading face #%05d / %05dzLFailed to read the image file %s, Please make sure that libjpeg is installedgào@r)Zaxis.)ZPILr8ÚImportErrorÚsliceÚtupleÚzipÚstopÚstartÚstepÚfloatÚintÚlenÚnpÚzerosZfloat32Ú enumerater&r,r-ÚcropÚresizeZasarrayÚndimÚRuntimeErrorÚmean)Ú file_pathsÚslice_ÚcolorrLr8Z default_sliceZh_sliceZw_sliceÚhÚwÚn_facesÚfacesÚiÚ file_pathZpil_imgZfacer5r5r6Ú _load_imgs€sVÿÿ ÿ ÿÿrYFcsøgg}}tt|ƒƒD]4}t||ƒ‰tˆƒsq‡fdd„ttˆƒƒDƒ}t|ƒ} | |kr?| dd¡}| |g| ¡| |¡qt|ƒ} | dkrNtd|ƒ‚t |¡}t ||¡}t||||ƒ} t | ¡}tj d¡ |¡| |||} }| ||fS)z~Perform the actual data loading for the lfw people dataset This operation is meant to be cached by a joblib wrapper. csg|]}tˆ|ƒ‘qSr5)r )r:Úf©Zfolder_pathr5r6Ú Øsz%_fetch_lfw_people..Ú_ú rz*min_faces_per_person=%d is too restrictiveé*)Úsortedrr r rGÚreplaceÚextendÚ ValueErrorrHÚuniqueZsearchsortedrYZarangeÚrandomZRandomStateÚshuffle)r1rQrRrLÚmin_faces_per_personZperson_namesrPZperson_nameÚpathsZ n_picturesrUÚtarget_namesr0rVÚindicesr5r[r6Ú_fetch_lfw_peopleÊs0 €ÿ rkÚbooleanZneither)ÚclosedÚleftg) rr.rLrgrRrQr/Ú return_X_yr"r#)Zprefer_skip_nested_validationgà?éFéÃéNé¬c Csˆt||||| d\} }t d| ¡t| ddd}| t¡} | |||||d\}}}| t|ƒd¡}tdƒ}|r;||fSt |||||d S) a|Load the Labeled Faces in the Wild (LFW) people dataset (classification). Download it if necessary. ================= ======================= Classes 5749 Samples total 13233 Dimensionality 5828 Features real, between 0 and 255 ================= ======================= For a usage example of this dataset, see :ref:`sphx_glr_auto_examples_applications_plot_face_recognition.py`. Read more in the :ref:`User Guide `. Parameters ---------- data_home : str or path-like, default=None Specify another download and cache folder for the datasets. By default all scikit-learn data is stored in '~/scikit_learn_data' subfolders. funneled : bool, default=True Download and use the funneled variant of the dataset. resize : float or None, default=0.5 Ratio used to resize the each face picture. If `None`, no resizing is performed. min_faces_per_person : int, default=None The extracted dataset will only retain pictures of people that have at least `min_faces_per_person` different pictures. color : bool, default=False Keep the 3 RGB channels instead of averaging them to a single gray level channel. If color is True the shape of the data has one more dimension than the shape with color = False. slice_ : tuple of slice, default=(slice(70, 195), slice(78, 172)) Provide a custom 2D slice (height, width) to extract the 'interesting' part of the jpeg files and avoid use statistical correlation from the background. download_if_missing : bool, default=True If False, raise an OSError if the data is not locally available instead of trying to download the data from the source site. return_X_y : bool, default=False If True, returns ``(dataset.data, dataset.target)`` instead of a Bunch object. See below for more information about the `dataset.data` and `dataset.target` object. .. versionadded:: 0.20 n_retries : int, default=3 Number of retries when HTTP errors are encountered. .. versionadded:: 1.5 delay : float, default=1.0 Number of seconds between retries. .. versionadded:: 1.5 Returns ------- dataset : :class:`~sklearn.utils.Bunch` Dictionary-like object, with the following attributes. data : numpy array of shape (13233, 2914) Each row corresponds to a ravelled face image of original size 62 x 47 pixels. Changing the ``slice_`` or resize parameters will change the shape of the output. images : numpy array of shape (13233, 62, 47) Each row is a face image corresponding to one of the 5749 people in the dataset. Changing the ``slice_`` or resize parameters will change the shape of the output. target : numpy array of shape (13233,) Labels associated to each face image. Those labels range from 0-5748 and correspond to the person IDs. target_names : numpy array of shape (5749,) Names of all persons in the dataset. Position in array corresponds to the person ID in the target array. DESCR : str Description of the Labeled Faces in the Wild (LFW) dataset. (data, target) : tuple if ``return_X_y`` is True A tuple of two ndarray. The first containing a 2D array of shape (n_samples, n_features) with each row representing one sample and each column representing the features. The second ndarray of shape (n_samples,) containing the target samples. .. versionadded:: 0.20 Examples -------- >>> from sklearn.datasets import fetch_lfw_people >>> lfw_people = fetch_lfw_people() >>> lfw_people.data.shape (13233, 2914) >>> lfw_people.target.shape (13233,) >>> for name in lfw_people.target_names[:5]: ... print(name) AJ Cook AJ Lamas Aaron Eckhart Aaron Guiel Aaron Patterson ©rr.r/r"r#z Loading LFW people faces from %sér©ÚlocationÚcompressÚverbose)rLrgrRrQéÿÿÿÿúlfw.rst)ÚdataZimagesr0riÚDESCR) r7r&r,rÚcacherkÚreshaperGrr )rr.rLrgrRrQr/ror"r#r r1ÚmÚ load_funcrVr0riÚXÚfdescrr5r5r6Úfetch_lfw_peopleõs2 û û ÿr„c CsÜt|dƒ}dd„|Dƒ}Wdƒn1swYdd„|Dƒ}t|ƒ}tj|td} tƒ} t|ƒD]Œ\}}t|ƒdkr\d| |<|d t|dƒdf|d t|d ƒdff} n-t|ƒdkrd | |<|d t|dƒdf|d t|dƒdff} n td|d|fƒ‚t| ƒD]3\}\}}zt||ƒ}Wnt y«t|t |d ƒƒ}Ynwttt|ƒƒƒ}t|||ƒ}| |¡qq5t| |||ƒ}t|jƒ}| d ¡}| d d ¡| d |d ¡||_|| t ddg¡fS)z}Perform the actual data loading for the LFW pairs dataset This operation is meant to be cached by a joblib wrapper. ÚrbcSsg|]}| ¡ ¡ d¡‘qS)ú )ÚdecodeÚstripÚsplit)r:Úlnr5r5r6r\´sz$_fetch_lfw_pairs..NcSsg|] }t|ƒdkr|‘qS)r)rG)r:Úslr5r5r6r\µsr=rrrrézinvalid line %d: %rzUTF-8zDifferent personszSame person)r-rGrHrIrFÚlistrJrcr Ú TypeErrorÚstrr`rÚappendrYÚshapeÚpopÚinsertÚarray)Úindex_file_pathr1rQrRrLZ index_fileÚsplit_linesZ pair_specsZn_pairsr0rPrWÚ componentsÚpairÚjÚnameÚidxZ person_folderÚ filenamesrXÚpairsr‘rUr5r5r6Ú_fetch_lfw_pairsªsH ÿþþÿù rž>Ú10_foldsÚtrainÚtest) Úsubsetrr.rLrRrQr/r"r#r c Cs¸t|||||d\} } t d|| ¡t| ddd}| t¡}dddd œ} || vr6td |tt| ¡ƒƒfƒ‚t | | |ƒ}||| |||d\}}}tdƒ}t| t|ƒd ¡||||dS)awLoad the Labeled Faces in the Wild (LFW) pairs dataset (classification). Download it if necessary. ================= ======================= Classes 2 Samples total 13233 Dimensionality 5828 Features real, between 0 and 255 ================= ======================= In the `original paper `_ the "pairs" version corresponds to the "restricted task", where the experimenter should not use the name of a person to infer the equivalence or non-equivalence of two face images that are not explicitly given in the training set. The original images are 250 x 250 pixels, but the default slice and resize arguments reduce them to 62 x 47. Read more in the :ref:`User Guide `. Parameters ---------- subset : {'train', 'test', '10_folds'}, default='train' Select the dataset to load: 'train' for the development training set, 'test' for the development test set, and '10_folds' for the official evaluation set that is meant to be used with a 10-folds cross validation. data_home : str or path-like, default=None Specify another download and cache folder for the datasets. By default all scikit-learn data is stored in '~/scikit_learn_data' subfolders. funneled : bool, default=True Download and use the funneled variant of the dataset. resize : float, default=0.5 Ratio used to resize the each face picture. color : bool, default=False Keep the 3 RGB channels instead of averaging them to a single gray level channel. If color is True the shape of the data has one more dimension than the shape with color = False. slice_ : tuple of slice, default=(slice(70, 195), slice(78, 172)) Provide a custom 2D slice (height, width) to extract the 'interesting' part of the jpeg files and avoid use statistical correlation from the background. download_if_missing : bool, default=True If False, raise an OSError if the data is not locally available instead of trying to download the data from the source site. n_retries : int, default=3 Number of retries when HTTP errors are encountered. .. versionadded:: 1.5 delay : float, default=1.0 Number of seconds between retries. .. versionadded:: 1.5 Returns ------- data : :class:`~sklearn.utils.Bunch` Dictionary-like object, with the following attributes. data : ndarray of shape (2200, 5828). Shape depends on ``subset``. Each row corresponds to 2 ravel'd face images of original size 62 x 47 pixels. Changing the ``slice_``, ``resize`` or ``subset`` parameters will change the shape of the output. pairs : ndarray of shape (2200, 2, 62, 47). Shape depends on ``subset`` Each row has 2 face images corresponding to same or different person from the dataset containing 5749 people. Changing the ``slice_``, ``resize`` or ``subset`` parameters will change the shape of the output. target : numpy array of shape (2200,). Shape depends on ``subset``. Labels associated to each pair of images. The two label values being different persons or the same person. target_names : numpy array of shape (2,) Explains the target values of the target array. 0 corresponds to "Different person", 1 corresponds to "same person". DESCR : str Description of the Labeled Faces in the Wild (LFW) dataset. Examples -------- >>> from sklearn.datasets import fetch_lfw_pairs >>> lfw_pairs_train = fetch_lfw_pairs(subset='train') >>> list(lfw_pairs_train.target_names) [np.str_('Different persons'), np.str_('Same person')] >>> lfw_pairs_train.pairs.shape (2200, 2, 62, 47) >>> lfw_pairs_train.data.shape (2200, 5828) >>> lfw_pairs_train.target.shape (2200,) rtzLoading %s LFW pairs from %srurrvrrr)r r¡rŸz+subset='%s' is invalid: should be one of %r)rLrRrQr{rz)r|rr0rir})r7r&r,rr~ržrcrr`Úkeysr rr rrG)r¢rr.rLrRrQr/r"r#r r1r€rZlabel_filenamesr•rr0rirƒr5r5r6Úfetch_lfw_pairsÞsB û ýÿÿ ÿûr¤)NTTrr)NFNr)NFN)/Ú__doc__ÚloggingÚnumbersrrÚosrrrrZos.pathrr r ÚnumpyrHZjoblibrÚutilsr Zutils._param_validationrrrrZutils.fixesrÚ_baserrrrÚ getLoggerÚ__name__r&r*r)r%r7rYrkrr@r?r„ržr¤r5r5r5r6Ús¼ ýýýýýõ ÿ3K ÿ+ öóõ( ÿ4 ÷ôö