|
1 |
| -**A friendly request: Thanks for visiting control-flag GitHub repository! If you find control-flag useful, we would appreciate a note from you (to niranjan.hasabnis@intel.com or justin.gottschlich@intel.com). And, of course, we love testimonials! |
| 1 | +**A friendly request: Thanks for visiting control-flag GitHub repository! If you find control-flag useful, we would appreciate a note from you (to niranjan.hasabnis@intel.com or justin.gottschlich@intel.com). And, of course, we love testimonials!** |
2 | 2 |
|
3 | 3 | -- The ControlFlag Team
|
4 | 4 |
|
@@ -78,20 +78,24 @@ Verilog support is WIP.
|
78 | 78 |
|
79 | 79 | #### Using patterns obtained from 6000 GitHub repos to scan repository of your choice
|
80 | 80 |
|
81 |
| -Download the training data for C language first |
82 |
| -([link](https://drive.google.com/file/d/1-jzs3zrKU541hwChaciXSk8zrnMN1mYc/view?usp=sharing)). |
| 81 | +Download the training data for C language depending on the memory constraints of your device. Note, however, that using smaller datasets may lead to lower accuracy in results. |
| 82 | + |
| 83 | +Dataset name | Size on disk | Memory requirements | Direct link | gdown ID | MD5 checksum |
| 84 | +-------------|--------------|---------------------|-------------|----------|------------- |
| 85 | +Small | ~100MB | ~400MB | [link](https://drive.google.com/file/d/1gvUyRXq1SeZD9g3i__RaamYAMo_QaQIb/view?usp=sharing) | 1gvUyRXq1SeZD9g3i__RaamYAMo_QaQIb | 2825f209aba0430993f7a21e74d99889 |
| 86 | +Medium | ~450MB | ~1.3GB | [link](https://drive.google.com/file/d/1zsCFJAKlZlSAWKPfBcVGcQNlFB5Gtwo3/view?usp=sharing) | 1zsCFJAKlZlSAWKPfBcVGcQNlFB5Gtwo3 | aab2427edebe9ed4acab75c3c6227f24 |
| 87 | +Large | ~9GB | ~13GB | [link](https://drive.google.com/file/d/1-jzs3zrKU541hwChaciXSk8zrnMN1mYc/view?usp=sharing) | 1-jzs3zrKU541hwChaciXSk8zrnMN1mYc | 1ba954d9716765d44917445d3abf8e85 |
83 | 88 |
|
84 | 89 | ```
|
85 |
| -$ python -m pip install gdown && gdown https://drive.google.com/uc?id=1-jzs3zrKU541hwChaciXSk8zrnMN1mYc |
86 |
| -$ (optional) md5sum c_lang_if_stmts_6000_gitrepos.ts.tgz |
87 |
| -1ba954d9716765d44917445d3abf8e85 |
88 |
| -$ tar -zxf c_lang_if_stmts_6000_gitrepos.ts.tgz |
| 90 | +$ python -m pip install gdown && gdown https://drive.google.com/uc?id=<id_from_table> |
| 91 | +$ (optional) md5sum <tgz_file> |
| 92 | +$ tar -zxf <tgz_file> |
89 | 93 | ```
|
90 | 94 |
|
91 | 95 | To scan C code of your choice, use below command:
|
92 | 96 |
|
93 | 97 | ```
|
94 |
| -$ scripts/scan_for_anomalies.sh -d <directory_to_be_scanned_for_anomalies> -t c_lang_if_stmts_6000_gitrepos.ts -o <output_directory_to_store_log_files> |
| 98 | +$ scripts/scan_for_anomalies.sh -d <directory_to_be_scanned_for_anomalies> -t <training_data>.ts -o <output_directory_to_store_log_files> |
95 | 99 | ```
|
96 | 100 |
|
97 | 101 | Once the run is complete (which could take some time depending on your system and the
|
|
0 commit comments