Mask Generation
sam2
hkhedr commited on
Commit
227a114
1 Parent(s): 38e0b24

Initial commit

Browse files
README.md CHANGED
@@ -42,7 +42,7 @@ with torch.inference_mode(), torch.autocast("cuda", dtype=torch.bfloat16):
42
  ...
43
  ```
44
 
45
- Refer to the [demo notebooks](https://github.com/facebookresearch/segment-anything-2/tree/main/notebooks) for details.
46
 
47
  ### Citation
48
 
@@ -55,4 +55,4 @@ To cite the paper, model, or software, please use the below:
55
  url={https://arxiv.org/abs/2408.00714},
56
  year={2024}
57
  }
58
- ```
 
42
  ...
43
  ```
44
 
45
+ Refer to the [demo notebooks](https://github.com/facebookresearch/segment-anything-2/tree/main/notebooks) for more details.
46
 
47
  ### Citation
48
 
 
55
  url={https://arxiv.org/abs/2408.00714},
56
  year={2024}
57
  }
58
+ ```
sam2_hiera_l.yaml → sam2.1_hiera_l.yaml RENAMED
@@ -93,6 +93,7 @@ model:
93
  use_mask_input_as_output_without_sam: true
94
  # Memory
95
  directly_add_no_mem_embed: true
 
96
  # use high-resolution feature map in the SAM mask decoder
97
  use_high_res_features_in_sam: true
98
  # output 3 masks on the first click on initial conditioning frames
@@ -101,7 +102,9 @@ model:
101
  iou_prediction_use_sigmoid: True
102
  # cross-attend to object pointers from other frames (based on SAM output tokens) in the encoder
103
  use_obj_ptrs_in_encoder: true
104
- add_tpos_enc_to_obj_ptrs: false
 
 
105
  only_obj_ptrs_in_the_past_for_eval: true
106
  # object occlusion prediction
107
  pred_obj_scores: true
@@ -114,4 +117,4 @@ model:
114
  multimask_max_pt_num: 1
115
  use_mlp_for_obj_ptr_proj: true
116
  # Compilation flag
117
- compile_image_encoder: False
 
93
  use_mask_input_as_output_without_sam: true
94
  # Memory
95
  directly_add_no_mem_embed: true
96
+ no_obj_embed_spatial: true
97
  # use high-resolution feature map in the SAM mask decoder
98
  use_high_res_features_in_sam: true
99
  # output 3 masks on the first click on initial conditioning frames
 
102
  iou_prediction_use_sigmoid: True
103
  # cross-attend to object pointers from other frames (based on SAM output tokens) in the encoder
104
  use_obj_ptrs_in_encoder: true
105
+ add_tpos_enc_to_obj_ptrs: true
106
+ proj_tpos_enc_in_obj_ptrs: true
107
+ use_signed_tpos_enc_to_obj_ptrs: true
108
  only_obj_ptrs_in_the_past_for_eval: true
109
  # object occlusion prediction
110
  pred_obj_scores: true
 
117
  multimask_max_pt_num: 1
118
  use_mlp_for_obj_ptr_proj: true
119
  # Compilation flag
120
+ compile_image_encoder: False
sam2.1_hiera_large.pt CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:8b36b71d5cafc83a0975d14d0afae81c3915804e12cc896b0665eaabcc445d56
3
- size 898089053
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:2647878d5dfa5098f2f8649825738a9345572bae2d4350a2468587ece47dd318
3
+ size 898083611