o HÆ&izã@sšUddlZddlZddlZddlmZmZddlZddlmZm Z Gdd„dƒZ dede defd d „Zdd„Z e aeed <ejdd„ƒZGdd„dƒZdS)éN)ÚCallableÚOptional)ÚKernelÚRegistrationHandlec@s4eZdZdZdefdd„Zdededefdd „Zd S)ÚAbstractImplHolderz4A holder where one can register an abstract impl to.ÚqualnamecCs||_d|_d|_dS©N)rÚkernelÚlib)Úselfr©rúGC:\wamp64\www\opt\env\Lib\site-packages\torch/_library/abstract_impl.pyÚ__init__ s zAbstractImplHolder.__init__ÚfuncÚsourceÚreturncsÒˆjdurtdˆj›dˆjj›dƒ‚tj ˆjd¡r$tdˆj›dƒ‚tj ˆjd¡r5tdˆj›dƒ‚t||ƒˆ_ˆjdurPˆj d ¡d }tj |d¡ˆ_tˆjˆƒ}ˆj ˆj|d¡‡fdd „}t|ƒS)z…Register an abstract impl. Returns a RegistrationHandle that one can use to de-register this abstract impl. Nz!impl_abstract(...): the operator z, already has an abstract impl registered at Ú.ZMetaz´ already has an DispatchKey::Meta implementation via a pre-existing torch.library or TORCH_LIBRARY registration. Please either remove that registration or don't call impl_abstract.ZCompositeImplicitAutograda- already has an implementation for this device type via a pre-existing registration to DispatchKey::CompositeImplicitAutograd.CompositeImplicitAutograd operators do not need an abstract impl; instead, the operator will decompose into its constituents and those can have abstract impls defined on them.z::rZFRAGMENTcs ˆjrˆj ¡dˆ_dˆ_dSr)r Z_destroyr r©rrr Úderegister_abstract_impl@s z=AbstractImplHolder.register..deregister_abstract_impl)r ÚRuntimeErrorrrÚtorchZ_CZ%_dispatch_has_kernel_for_dispatch_keyrr ÚsplitZlibraryÚLibraryÚconstruct_meta_kernelÚimplr)rrrÚnsÚmeta_kernelrrrr Úregisters0 þÿÿÿÿ zAbstractImplHolder.registerN) Ú__name__Ú __module__Ú__qualname__Ú__doc__Ústrrrrrrrrr r srrÚabstract_impl_holderrcs.ˆjdusJ‚t ˆjj¡‡‡fdd„ƒ}|S)Ncs`ˆjdusJ‚ˆjj‰‡‡fdd„}t|ƒˆj|i|¤ŽWdƒS1s)wYdS)Ncstdˆ›dˆ›dƒ‚)Nz.meta_kernel..error_on_ctx)r rÚset_ctx_getter)ÚargsÚkwargsr$©r#r)rr rNs $ÿz*construct_meta_kernel..meta_kernel)r Ú functoolsÚwrapsr)rr#rrr(r rIsrcCsdSrrrrrr Úget_nonecsr+Úglobal_ctx_getterccs"t}z |adVW|adS|awr)r,)Z ctx_getterÚprevrrr r%js€r%c@sHeZdZdZdd„Zdddœdejfdd „Zd ddœdejfdd„ZdS) ÚAbstractImplCtxzS Context object for writing abstract implementations for custom operators. cCs||_||_dSr)Ú _shape_envÚ_op)rr/r0rrr rzs zAbstractImplCtx.__init__éN©ÚminÚmaxrcCst d¡|j||dS)NzIcreate_unbacked_symint is deprecated, please use new_dynamic_size insteadr2)ÚwarningsÚwarnÚnew_dynamic_size)rr3r4rrr Úcreate_unbacked_symint~sÿz&AbstractImplCtx.create_unbacked_symintrcCsŒ|jdus |jjstjj |j¡‚t|tjƒst|tjƒr(t d|›d|›dƒ‚|dkr4t d|›dƒ‚|j ¡}tjjj j|||d|S)a= Constructs a new symint (symbolic int) representing a data-dependent value. This is useful for writing the abstract implementation (which is necessary for torch.compile) for a CustomOp where an output Tensor has a size that depends on the data of the input Tensors. Args: min (int): A statically known inclusive lower bound for this symint. Default: 0 max (Optional[int]): A statically known inclusive upper bound for this symint. Default: None .. warning: It is important that the ``min`` and ``max`` (if not None) values are set correctly, otherwise, there will be undefined behavior under torch.compile. The default value of ``min`` is 2 due to torch.compile specializing on 0/1 sizes. You must also verify that your implementation on concrete Tensors (e.g. CPU/CUDA) only returns Tensors where the size that corresponds to the symint also has respects these constraint. The easiest way to do this is to add an assertion in the CPU/CUDA/etc implementation that the size follows these bounds. Example:: >>> # An operator with data-dependent output shape >>> lib = torch.library.Library("mymodule", "FRAGMENT") >>> lib.define("mymodule::custom_nonzero(Tensor x) -> Tensor") >>> >>> @torch.library.impl_abstract("mymodule::custom_nonzero") >>> def custom_nonzero_abstract(x): >>> # Number of nonzero-elements is data-dependent. >>> # Since we cannot peek at the data in an abstract impl, >>> # we use the ctx object to construct a new symint that >>> # represents the data-dependent size. >>> ctx = torch.library.get_ctx() >>> nnz = ctx.new_dynamic_size() >>> shape = [nnz, x.dim()] >>> result = x.new_empty(shape, dtype=torch.int64) >>> return result >>> >>> @torch.library.impl(lib, "custom_nonzero", "CPU") >>> def custom_nonzero_cpu(x): >>> x_np = x.numpy() >>> res = np.stack(np.nonzero(x_np), axis=1) >>> return torch.tensor(res, device=x.device) Nzctx.new_dynamic_size(min=z, max=zZ): expected min and max to be statically known ints but got SymInt. This is not supported.rzc, ...): expected min to be greater than or equal to 0: this API can only create non-negative sizes.r2)r/Zallow_dynamic_output_shape_opsrZ_subclassesZfake_tensorZDynamicOutputShapeExceptionr0Ú isinstanceÚSymIntÚ ValueErrorr8ZfxZexperimentalZsymbolic_shapesZ_constrain_range_for_size)rr3r4Úresultrrr r7„s" 3ÿÿ ÿ ÿz AbstractImplCtx.new_dynamic_size) rrr r!rrr:r8r7rrrr r.us r.)Ú contextlibr)r5ÚtypingrrrZtorch._library.utilsrrrr"rr+r,Ú__annotations__Úcontextmanagerr%r.rrrr Ús& ?ÿÿ þ