Spaces:

tnk2908
/

ai-text-steganography

Sleeping

App Files Files Community

tnk2908 commited on Jul 29

Commit

0852a55

•

1 Parent(s): 9bd2dae

Add comments and update README.md

Browse files

Files changed (4) hide show

README.md +6 -1
analyse.py +0 -3
notebooks/plot.ipynb +0 -0
stegno.py +12 -1

README.md CHANGED Viewed

@@ -34,6 +34,10 @@ python api.py
 ```Bash
 python main.py -h
 ```
 ## Documentation
 - To access the documentation for the RestAPI, launch the RestAPI and go to <http://localhost:6969/docs>
 ## Configuration
@@ -55,8 +59,9 @@ python main.py -h
 - [x] Hashing schemes.
 - [x] Rest API.
 - [x] Basic Demo.
-- [ ] Statistical  experiments.
 - [ ] Attack strategies
     - [ ] White-box
     - [ ] Black-box

 ```Bash
 python main.py -h
 ```
+- To run analysis, see the help message by:
+```Bash
+python analysis.py -h
+```
 ## Documentation
 - To access the documentation for the RestAPI, launch the RestAPI and go to <http://localhost:6969/docs>
 ## Configuration
 - [x] Hashing schemes.
 - [x] Rest API.
 - [x] Basic Demo.
+- [x] Statistical  experiments.
 - [ ] Attack strategies
     - [ ] White-box
     - [ ] Black-box

analyse.py CHANGED Viewed

@@ -537,9 +537,6 @@ def main(args):
     processor.run()
     processor.plot(args.figs_dir)
-    # if args.figs_dir:
-    #     process_results(results, args.figs_dir)
 if __name__ == "__main__":
     args = create_args()

     processor.run()
     processor.plot(args.figs_dir)
 if __name__ == "__main__":
     args = create_args()

notebooks/plot.ipynb ADDED Viewed

The diff for this file is too large to render. See raw diff

stegno.py CHANGED Viewed

@@ -26,19 +26,28 @@ def generate(
     generator: torch.Generator | None = None,
 ):
     """
-    Generate the sequence containing the hidden data.
     Args:
         tokenizer: tokenizer to use.
         model: generative model to use.
         prompt: input prompt.
         msg: message to hide in the text.
         delta: bias add to scores of token in valid list.
         msg_base: base of the message.
         seed_scheme: scheme used to compute the seed.
         window_length: length of window to compute the seed.
         salt_key: salt to add to the seed.
         private_key: private key used to compute the seed.
     """
     if len(start_pos_p) == 1:
         start_pos = start_pos_p[0]
@@ -134,6 +143,8 @@ def decrypt(
         window_length: length of window to compute the seed.
         salt_key: salt to add to the seed.
         private_key: private key used to compute the seed.
     """
     tokenized_input = tokenizer(text, return_tensors="pt").to(device)

     generator: torch.Generator | None = None,
 ):
     """
+    Generate the sequence containing the hidden data. This supports batch input/output.
     Args:
         tokenizer: tokenizer to use.
         model: generative model to use.
         prompt: input prompt.
         msg: message to hide in the text.
+        start_pos_p: start position to hide message.
         delta: bias add to scores of token in valid list.
         msg_base: base of the message.
         seed_scheme: scheme used to compute the seed.
         window_length: length of window to compute the seed.
         salt_key: salt to add to the seed.
         private_key: private key used to compute the seed.
+        min_new_tokens_ratio: ratio between min generated tokens and required token length.
+        min_new_tokens_ratio: ratio between max generated tokens and required token length.
+        do_sample: whether to do sampling or greedy generation.
+        num_beams: number of beams used in beam search.
+        repetition_penalty: penalty to avoid repetitiveness.
+        generator: generation used to genereate. This is mainly used to produce deterministic results.
+    Returns:
+        generated texts, hidden message rates, tokens information
     """
     if len(start_pos_p) == 1:
         start_pos = start_pos_p[0]
         window_length: length of window to compute the seed.
         salt_key: salt to add to the seed.
         private_key: private key used to compute the seed.
+    Returns:
+        shifted versions of the message
     """
     tokenized_input = tokenizer(text, return_tensors="pt").to(device)