-
Notifications
You must be signed in to change notification settings - Fork 615
Issues
is:issue state:open
is:issue state:open
Issue creation is restricted in this repository
Search results
[Question] Do we need to set attention implementation to eager in config and in rotary setup component testing method?
complexity-highVery complicated changes for people to address who are quite familiar with the codeVery complicated changes for people to address who are quite familiar with the codequestionFurther information is requestedFurther information is requestedTransformerBridgeBug specific to the new TransformerBridge systemBug specific to the new TransformerBridge systemStatus: Open.#1440 In TransformerLensOrg/TransformerLens;[Proposal] Migrate the dev/CI Jupyter stack to Notebook 7
complexity-moderateModerately complicated issues for people who have intermediate experience with the codeModerately complicated issues for people who have intermediate experience with the codedemoCreating a demo or tutorialCreating a demo or tutorialdocumentationImprovements or additions to documentationImprovements or additions to documentationenhancementNew feature or requestNew feature or requesthelp wantedExtra attention is neededExtra attention is neededminorRelease a minor versionRelease a minor versionStatus: Open.#1438 In TransformerLensOrg/TransformerLens;[Proposal] Add HunYuan dense adapter (HunYuanDenseV1ForCausalLM)
complexity-simpleSimple issues, which may be good for beginnersSimple issues, which may be good for beginnersgood first issueGood for newcomersGood for newcomershelp wantedExtra attention is neededExtra attention is needednew-architectureThis card involves adding a new architecture .This card involves adding a new architecture .TransformerBridgeBug specific to the new TransformerBridge systemBug specific to the new TransformerBridge systemStatus: Open.#1407 In TransformerLensOrg/TransformerLens;[Proposal] Add Falcon-H1 parallel hybrid adapter (FalconH1ForCausalLM)
complexity-highVery complicated changes for people to address who are quite familiar with the codeVery complicated changes for people to address who are quite familiar with the codehelp wantedExtra attention is neededExtra attention is neededminorRelease a minor versionRelease a minor versionnew-architectureThis card involves adding a new architecture .This card involves adding a new architecture .TransformerBridgeBug specific to the new TransformerBridge systemBug specific to the new TransformerBridge systemStatus: Open.#1403 In TransformerLensOrg/TransformerLens;[Proposal] Add Nemotron-H hybrid Mamba2-Transformer adapter
complexity-highVery complicated changes for people to address who are quite familiar with the codeVery complicated changes for people to address who are quite familiar with the codehelp wantedExtra attention is neededExtra attention is neededlow-priorityMaintainers are not prioritising this work currently.Maintainers are not prioritising this work currently.new-architectureThis card involves adding a new architecture .This card involves adding a new architecture .TransformerBridgeBug specific to the new TransformerBridge systemBug specific to the new TransformerBridge systemStatus: Open.#1402 In TransformerLensOrg/TransformerLens;Use Case: Multi-GPU support for 35B SAE training & minor fix for legacy n_devices bug
questionFurther information is requestedFurther information is requestedseen_by_maintainersConfirms that a maintainer is aware of this card.Confirms that a maintainer is aware of this card.TransformerBridgeBug specific to the new TransformerBridge systemBug specific to the new TransformerBridge systemStatus: Open.#1356 In TransformerLensOrg/TransformerLens;[Bug Report] [macOS-arm64] Cached eager attention NaNs in transformers v5 — blocks bridge KV-cache generation
discussionNo action needed yetNo action needed yetseen_by_maintainersConfirms that a maintainer is aware of this card.Confirms that a maintainer is aware of this card.toolingAnything pertaining to outside tools used within the codebaseAnything pertaining to outside tools used within the codebasewontfixThis will not be worked onThis will not be worked onStatus: Open.#1322 In TransformerLensOrg/TransformerLens;[Proposal] Additional Architecture Adapter tests
complexity-simpleSimple issues, which may be good for beginnersSimple issues, which may be good for beginnersenhancementNew feature or requestNew feature or requestgood first issueGood for newcomersGood for newcomershelp wantedExtra attention is neededExtra attention is neededlow-priorityMaintainers are not prioritising this work currently.Maintainers are not prioritising this work currently.TransformerBridgeBug specific to the new TransformerBridge systemBug specific to the new TransformerBridge systemStatus: Open.[Proposal] Gemma4 Architecture Adapter
complexity-moderateModerately complicated issues for people who have intermediate experience with the codeModerately complicated issues for people who have intermediate experience with the codeenhancementNew feature or requestNew feature or requesthelp wantedExtra attention is neededExtra attention is neededTransformerBridgeBug specific to the new TransformerBridge systemBug specific to the new TransformerBridge systemStatus: Open.#1297 In TransformerLensOrg/TransformerLens;[Proposal] Add support for
cpu,meta, anddiskto TransformerBridgedevice_mapcomplexity-moderateModerately complicated issues for people who have intermediate experience with the codeModerately complicated issues for people who have intermediate experience with the codeenhancementNew feature or requestNew feature or requesthelp wantedExtra attention is neededExtra attention is neededTransformerBridgeBug specific to the new TransformerBridge systemBug specific to the new TransformerBridge systemStatus: Open.#1280 In TransformerLensOrg/TransformerLens;[Question] How do I add a custom generative video transformer into TransformerLens?
complexity-highVery complicated changes for people to address who are quite familiar with the codeVery complicated changes for people to address who are quite familiar with the codeStatus: Open.#869 In TransformerLensOrg/TransformerLens;[Question] Does TransformerLens support LVLM like Qwen2-VL?
complexity-moderateModerately complicated issues for people who have intermediate experience with the codeModerately complicated issues for people who have intermediate experience with the codemodel-requestAny issues related to requesting additional model supportAny issues related to requesting additional model supportStatus: Open.#867 In TransformerLensOrg/TransformerLens;