trufflehog

mirror of https://github.com/trufflesecurity/trufflehog.git synced 2024-11-10 07:04:24 +00:00

Author	SHA1	Message	Date
Bill Rich	78d8dd3abf	Add handlerOpts back (#2258 )	2023-12-22 12:11:59 -08:00
Bill Rich	ceff786db4	Skip all binaries (#2256 ) * Skip all binaries * Remove noop * Drop handlerOpts	2023-12-22 12:01:07 -08:00
Dustin Decker	7d93adc1d0	Add skip archive support (#2257 )	2023-12-22 11:55:23 -08:00
ahrav	39f0310f1f	[fixup] - Refactor to Pass Reader for Binary Diffs and Archived Data; Optimize /tmp Directory Cleanup (#2253 )	2023-12-22 07:41:54 -08:00
Cody Rose	9c8674777c	Dedupe some source log keys (#2250 ) The source manager attaches some context keys, but in certain circumstances, they're already present, resulting in duplicate keys. This PR changes the attachment to be conditional. It also adds some new log messages to track source startup progress.	2023-12-21 10:11:52 -08:00
ahrav	07ae9ec870	Fix goroutine leak (#2251 )	2023-12-20 21:09:05 -08:00
ahrav	64c7365364	add secretID to chunk (#2242 )	2023-12-18 15:27:49 -08:00
ahrav	5c6ce693c1	[feat] - Make skipping binaries configurable (#2226 ) * Make skipping binaries configurable * remove ioutil * fix * address comments * address comments * use multi-reader * remove print * use const * fix test * fix my stupidness	2023-12-15 11:46:27 -08:00
Miccah	78b5a95342	[chore] Prevent panic when ChunkError has a nil Unit (#2227 )	2023-12-15 11:11:28 -08:00
Richard Gomez	b3040b1227	fix(github): remove unused 'members' var (#2202 )	2023-12-14 11:53:24 -08:00
Miccah	f6bbc59bf6	Check for SourceUnit support dynamically in the SourceManager (#2205 ) * Check for SourceUnit support dynamically in the SourceManager * Only call the function if we can use source units	2023-12-14 11:48:15 -08:00
Miccah	9f6a47da3f	[chore] Remove omitempty tags on JobProgressMetrics and UnitMetrics (#2204 )	2023-12-12 10:02:56 -08:00
Mike Vanbuskirk	53f060a08e	Add disk buffer tempfile cleanup (#2130 ) * add tempfile creation - break PID retrieval into sep. function * add tmpfile cleanup func * add file cleanup to main cleanup func * refactor file logic to only return name string * add temp buffer naming to gcs * add temp buffer naming to s3 * add temp buffer naming to filesystem * add temp buffer naming to git * consolidate cleanup functions - have single function handle both files and dirs - remove interface(not needed with a single func implementation) - change calls to `New(...)` to reflect config implementation - simplify automation in main.go - update disk-buffer-reader dependency * integrate changes from pr #2133 * merge main * checkout from main to revert conflict issues * re-add buffer logic to git * interface no longer needed * move string format to global const --------- Co-authored-by: Ahrav Dutta <ahrav.dutta@trufflesec.com>	2023-12-11 18:31:50 -05:00
Richard Gomez	d1a2d9e832	chore: propagate log context to handlers (#2191 )	2023-12-10 10:30:11 -08:00
ahrav	2728e514d2	move logic to main Chunks method (#2194 )	2023-12-08 14:51:24 -08:00
ahrav	2a7813929b	add metrics for gitlab (#2190 )	2023-12-08 09:50:09 -08:00
ahrav	4b31b39d6b	[chore] - Refactor common code into a separate function (#2179 ) * Refactor common code into a separate function * rename vars * make sure to set the scanOptions fields * address comments	2023-12-08 08:44:35 -08:00
ahrav	b75991850a	[chore] - Compile regex once (#2176 ) * move regex compilation out of the fxn * missed a spot * merge main	2023-12-07 07:26:27 -08:00
ahrav	0595a3baac	allow targets for the source manager (#2182 ) * allow targets to the source manager * use targets	2023-12-06 16:38:35 -08:00
ahrav	e6bc7f4451	remove unnecessary Git cmd check (#2175 )	2023-12-06 13:38:34 -08:00
ahrav	cb81f7d11a	[feat] - Remove go-git dependency (#2174 ) * remove use of go-git for binary files * fix it * use limit reader * fix comment * fix test * address comments * address comments * address comments	2023-12-06 13:38:01 -08:00
ahrav	13da76d357	skip files we can't scan (#2170 )	2023-12-04 13:37:11 -08:00
ahrav	996a11dcc0	[chore] - remove deprecated types (#2168 ) * remove deprecated types * missed one	2023-12-04 13:23:58 -08:00
ahrav	37d9e5eedf	[chore] - Increase pagination limit (#2154 ) * increae pagination limit * rename	2023-12-04 10:14:46 -08:00
ahrav	279f915799	[chore] - fix error comparisons (#2142 ) * fix error comparisons * fix imports	2023-12-01 08:32:41 -08:00
ahrav	52ffab1034	[chore] - fix import name clashes (#2143 ) * fix import name clashes * fix missing var	2023-12-01 06:53:15 -08:00
Miccah	e498c80b3d	Fix nil pointer dereference when checking if a unit IsFinished (#2135 )	2023-11-29 14:19:31 -08:00
Miccah	7ecd43ab1e	[chore] Minor cleanup of source_manager.go (#2134 )	2023-11-29 11:08:25 -08:00
Miccah	78219a27b3	Call Finish in SourceManager after the semaphore is released (#2121 )	2023-11-24 13:22:08 -08:00
Richard Gomez	024aa056b9	chore(github): add a newline between titles and bodies (#2124 )	2023-11-23 16:14:28 -08:00
Richard Gomez	1f502fd42c	feat(github): scan issue & pr titles (#1899 )	2023-11-22 19:15:27 -08:00
Dustin Decker	75e869faff	Fix forks and repos counter, add metric for orgs enumerated (#2118 )	2023-11-21 08:52:33 -08:00
Miccah	39a603d2dc	[chore] Add JSON tags to job metrics (#2114 )	2023-11-16 17:08:33 -08:00
ahrav	d334b3075e	move all Git setup into Init method (#2105 ) * add proto fields for git * add uri to proto * move all git setup into Init method * fix logic for when to use repoPath	2023-11-16 13:59:53 -08:00
Miccah	9d6bc8c504	Refactor git source to support scanning units (#2083 )	2023-11-01 09:52:58 -07:00
Miccah	52600a897a	[chore] Replace chunks channel with ChunkReporter in git based sources (#2082 ) ChunkReporter is more flexible and will allow code reuse for unit chunking. ChanReporter was added as a way to maintain the original channel functionality, so this PR should not alter existing behavior.	2023-11-01 09:22:44 -07:00
ahrav	95e0090bc2	[chore] - correctly handle input shorter than 512 bytes (#2077 ) * correctly handle input shorter than 512 bytes * add tests * reorder tests * add another test case * update test * address comment	2023-10-31 16:42:42 -07:00
Miccah	57203a56cd	[chore] Fix SourceManager flaky test (#2059 ) * [chore] Fix SourceManager flaky test Sorting by EndTime is not deterministic, however sorting by StartTime should be. StartTime is set in a goroutine that's limited by WithConcurrentUnits, so it should happen in order that the units are received. * Sort by unit ID	2023-10-30 19:16:55 -07:00
Dustin Decker	05fae156e1	Add TravisCI source (#1877 ) * Add TravisCI source * update test to use sourcestest * Remove jobPage loop ListByBuild does not support pagination, so this was infinitely repeating. https://developer.travis-ci.com/resource/jobs#find * Continue chunking on error * review updates * update readme --------- Co-authored-by: Miccah Castorina <m.castorina93@gmail.com>	2023-10-30 07:28:25 -07:00
Mike Vanbuskirk	4636dc08f6	Add temp directory management (#1878 ) * adds func to get scannerPIDs * add cleanup and call to get pids * move pid handling to git module * remove PID logic from main * refactor testing code to handle different exec name * cleanup linting errors * add better logging, fix dir if clause * some PR fixups * mod fixup * add interfaces for helper funcs * refactor cleanup into main, getPID into git * lint and test fixups, remove fail on n<2 pids * simplify pid sorting * use filepath.Join * use Args[0] for exec name, fix logger * formatting fixup * move functionality into cleantemp pkg * go mod fixup * remove redundant testing comment * fix go.sum issues * add 15m ticker loop for cleanup * enclose ticker in function for goroutine defer fix cleantemp interface * make time more readable * add check for non-local Trufflehog PIDs * allow deletion even if no non-local pids found * bundle intial cleanup into runCleanup func * add explicit regex check for tempdir format	2023-10-26 12:28:56 -04:00
Bill Rich	c5efa870ff	Use latest dbr (#1955 )	2023-10-24 07:52:49 -07:00
Miccah	0b16142d4f	Add UnitHook and NoopHook implementations (#1930 ) * Add UnitHook and NoopHook implementations The UnitHook tracks metrics per unit of a job, and emits them on a channel once finished. It should work even if the Source does not support source units. * Refactor channel to use an LRU cache instead An LRU cache has a more favorable failure mode than the channel. With the channel, if the consumer stopped consuming metrics, scanning would block. With the LRU cache, metrics will be dropped when space runs out and a log message emitted.	2023-10-23 14:27:01 -07:00
Miccah	b8724e87e6	Use the configured include repositories in the GitHub filter (#1926 )	2023-10-20 19:03:28 -07:00
Richard Gomez	3acc65b2fb	chore(github): reduce comment log verbosity (#1922 )	2023-10-20 16:16:38 -07:00
Cody Rose	7ac7fa8728	Move Github comments check to fix a test #1927	2023-10-19 19:23:55 -04:00
Richard Gomez	4b821e9732	Handle secondary GitHub ratelimits (#1912 ) * fix(github): reduce visibility-related api calls * fix(github): handle secondary ratelimits	2023-10-19 14:54:45 -04:00
Miccah	758344711a	Export ChunkError fields and add ErrorsFor convenience method (#1920 )	2023-10-19 08:46:49 -07:00
Richard Gomez	6ea3a7da4a	fix(github): normalize repo cache (#1897 )	2023-10-17 15:07:47 -07:00
Miccah	03dc7cb68d	[chore] Add SourceUnitEnumChunker filesystem tests (#1873 ) * [chore] Add SourceUnitEnumChunker filesystem tests * Ensure reported units are exactly what is expected	2023-10-16 10:42:18 -07:00
Miccah	f09bce3f75	[chore] Fix flaky TestJobProgressElapsedTime (#1872 )	2023-10-06 17:05:05 -07:00
ahrav	3d2490ca80	use Repositories field from conn. (#1860 )	2023-10-04 13:56:02 -07:00
Miccah	0d451aa806	Fix bug in chunker that surfaces with a flaky passed in io.Reader (#1838 ) * Fix bug in chunker that surfaces with a flaky passed in io.Reader The chunker was previously expecting the passed in io.Reader to always successfully read a full buffer of data, however it's valid for a Reader to return less data than requested. When this happens, the chunker would peek the same data that it then reads in the next iteration of the loop, causing the same data to be scanned twice. Co-authored-by: ahrav <ahravdutta02@gmail.com> * Fix EOF error check * Use io.ReadFull in Chunker --------- Co-authored-by: ahrav <ahravdutta02@gmail.com>	2023-10-02 09:38:23 -07:00
ahrav	c4bc8fc7fa	[bug] - correctly check err (#1824 ) * correctly check err. * address comments. * update. * add comment. * update comment.	2023-09-27 15:52:07 -07:00
Cody Rose	e9efed85c2	Use S3 credentials waterfall (#1823 ) This PR updates the S3 source to use explicitly configured credentials if they're available and follow the normal AWS credentials waterfall if they're not. This is irrespective of whether role assumption is configured. This changes the previous behavior, which was to use waterfall credentials only if role assumption was configured and explicitly configured credentials only when it was not.	2023-09-27 16:57:47 -04:00
joeleonjr	699547b7d3	consolidated pr and issue descr/comment flags (#1827 )	2023-09-27 15:54:02 -04:00
ahrav	bf47fd69bb	Github partial scan (#1804 ) * Add ability for targetted partial scans of Github. * update comment. * add more tests. * add additiional test. * address comments.	2023-09-26 12:38:33 -07:00
joeleonjr	1e42dae734	added PR and Issue body scanning (#1816 ) * added PR and Issue body scanning; adjusted CLI args to fit * removed print statement from debugging * removed exclude-commits; adjusted CLI flags * minor changes to match main branch * fixing logic * updating README for --issues and --prs	2023-09-26 12:25:48 -04:00
āh̳̕mͭͭͨͩ̐e̘ͬ́͋ͬ̊̓͂d	62b2195502	Adding new function SetProgressOngoing to be used when the source does not yet know how many items it is scanning and does not want to display a percentage complete. (#1802 ) Co-Authored-By: @mcastorina	2023-09-21 13:26:10 -04:00
Miccah	efa404942a	Add ability to dynamically scale concurrently running sources (#1790 ) * Add ability to dynamically scale concurrently running sources Refactor SourceManager to use a counting semaphore to allow for dymanically changing limits. This complicated `Wait() error` which needs to return the first error encountered. We previously got that for free using `errgroup.Group`, however now we need to handle that ourselves. `Wait()` needs to return an error for use in the engine to set the correct exit code. * Group third party imports together	2023-09-20 16:49:56 -07:00
ahrav	22876f8381	replace interface{} with any. (#1771 )	2023-09-15 04:35:15 -07:00
Miccah	dbcb888063	Update Source interface to use SourceID and JobID types (#1774 ) The previous implementation used int64 for both, which can be mixed up easily. Using distinct types adds a layer of type safety checked by the compiler.	2023-09-14 11:28:24 -07:00
Cody Rose	1155ee2736	Implement Gitlab source validation (#1765 ) This PR implements validation of Gitlab source configuration. I was hoping to be able to unify more of the implementation of Validate and Chunks, but there was more divergence than I expected. Specifically, Chunks handles a fair number of Gitlab errors that aren't configuration errors (e.g. "Gitlab returned a repo with an unparseable URL"). Accommodating these in the Validate code path felt wrong, and I wasn't able to create a common code path that could accommodate both Validate and Chunks without looking awful.	2023-09-13 11:51:12 -04:00
Miccah	72b6a9ec6b	Add a SourceType constant to all source packages (#1768 )	2023-09-12 17:23:25 -07:00
Miccah	be4d0bcb41	Refactor SourceManager to remove Enrollment (#1740 ) * Refactor SourceManager to remove Enrollment Initializing the Source will be the responsibility of the caller. The SourceManager exposes a GetIDs method for getting a source and job ID. * Update tests * Update engine usage * Update apiClient interface to have one GetIDs method * Update SourceManager usage in engine	2023-09-12 16:58:38 -07:00
Mike Vanbuskirk	de540652cb	verbosity updates to s3 source (#1750 )	2023-09-11 14:53:43 -05:00
ahrav	2a9f34962d	Add optional param to Chunks (#1747 ) * Add interface for targeted chunking. * use optional args. * update Chunks method signature. * update tests. * fix test. * update QueryCriteria type.	2023-09-07 09:03:37 -07:00
ahrav	abb131e502	[chore] - update Docker source (#1708 ) * Add concurrency and common chunker. * lint. * address comments.	2023-09-05 07:40:38 -07:00
Cody Rose	afe708519b	Validate S3 source (#1715 ) This PR adds S3 source validation. This is accomplished by factoring out common "bucket visiting" logic to be used by both scanning and validation.	2023-09-05 10:18:58 -04:00
Cody Rose	a2c0abbfd6	Unify S3 client creation logic (#1657 ) This PR unifies some code paths within the S3 source. This is being done to better support a future implementation of S3 source validation; less code that runs means less code to validate. The logical change is to move the handling of "role-less" operation down the call tree, which allows for a single code path for more of the S3 code. This PR also fixes a bug that would occur in the (rare) case that the source couldn't create a regional S3 client. Before, an error would be logged, but it would be followed by a panic. Now the bucket in question is skipped.	2023-08-30 17:49:37 -04:00
Miccah	522b2fab29	Add a cancel cause to job cancellation (#1728 )	2023-08-30 12:00:44 -07:00
Miccah	7ba880f47a	Add AvailableCapacity method to SourceManager (#1665 )	2023-08-29 12:36:44 -07:00
ahrav	2b1b1b5ad0	Add jobID to chunk. (#1721 )	2023-08-29 12:02:30 -07:00
ahrav	c51e8f8af5	buffer channel. (#1718 )	2023-08-28 18:08:31 -07:00
ahrav	0932ea224b	[chore] - Prevent nil deref panic (#1709 )	2023-08-26 20:39:50 -07:00
Miccah	5eb776cd61	Support cancelling a run from a JobProgressRef (#1663 )	2023-08-25 10:43:33 -07:00
Cody Rose	33eed42e17	Test S3 role assumption (#1655 ) This PR adds a test of the S3 role assumption functionality. It currently only tests role assumption within a single account.	2023-08-25 11:30:08 -04:00
Miccah	61977412df	Add SourceName to JobProgressRef (#1664 )	2023-08-25 07:48:25 -07:00
ahrav	4f4a79f62b	Support azure git links (#1662 ) * Support azure git links. * update comment. * update test names.	2023-08-24 14:36:52 -07:00
Miccah	f2bfcc7ac6	Capture source-reported progress in JobProgress snapshot (#1661 )	2023-08-24 11:28:50 -07:00
Miccah	a4401939a8	Add ElapsedTime method to JobProgressMetrics (#1660 )	2023-08-24 11:28:33 -07:00
ahrav	a2a7a2087e	[chore] - update comments and logs. (#1654 ) * update comments and logs. * Update github.go	2023-08-23 13:18:07 -07:00
ahrav	9ae72308be	Include the job ID in a chunk (#1652 ) * Include the job ID in a source's chunk. * address comments. * address comments.	2023-08-22 14:00:27 -07:00
Zubair Khan	fd00d2b30b	add rate limit and consumption metrics for GitHub (#1651 ) * add rate limit and consumption metrics * incrment after each repo scanned * update repo scanned label name	2023-08-22 15:01:59 -04:00
Cody Rose	059ea23a72	update s3 test bucket (#1649 ) We're switching our S3 source test account over to a different one, which means we have to change the bucket name.	2023-08-22 12:43:38 -04:00
Miccah	5cfbde783f	Fix reversed ordering of arguments (#1648 ) The source manager initialization function was defined as `sourceID` followed by `jobID`, while the source initialization function is the reverse. This is confusing and easy to mix up since the parameters are the same type. This commit adds a test to make sure the source manager initializes in the correct order, but it doesn't prevent the library user to make the same mistake. We may want to consider using different types.	2023-08-22 07:55:56 -07:00
Zubair Khan	9a13c74a35	add thog CLI support for GitHub config validate (#1626 ) * add exportable validate function for github * update validator * use the context * gate to prevent panic * wrap error with context * wrap error with context for basic auth and unauth	2023-08-22 10:22:39 -04:00
Cody Rose	dbb2c2e319	wait before finishing s3 test (#1647 ) The S3 source test verifies that chunking has completed, but it didn't actually wait for completion first, leading to non-deterministic test failures.	2023-08-21 12:36:36 -04:00
ahrav	d51e3b6d83	Only scan gist comments or repo comments. (#1646 )	2023-08-20 11:38:28 -07:00
Mike Vanbuskirk	64dd49f9ce	add role assumption for s3 source (#1477 ) * add role assumption for s3 source * refactor role assumption to repeatable string user can pass array of roles to assume * refactor s3 chunks to handle passed roleARNs * add role-session name use timestamp to make dynamic * add docstring for rolearn strings() * make sure role ars are passed into source * refactor role assumption functionality break s3 bucket scanning into sep. function * add log check on assume role * fix role iteration - Make sure s3 struct is populated with roles - add separate new client instantiation for role-based access - iterates through each role * add comment * protobuf revert for merge * re-run make proto * lint cleanup * cleanup TODOs * drop redundant switch case in assumerole client * use less verbose 'ctx' designator * breakout functionality from Chunks - separate functions for: - enumerating buckets to scan - scanning objects within the buckets * remake protobuf defs * allow scan to continue on single bucket err * add readme docs * minor fixups	2023-08-17 20:30:20 -04:00
ahrav	0ae8cf5d35	[bug] - handle IOOR panic (#1639 ) * handle IOOR panic. * use a better fxn name. * increae timeout for test to compete. * simplify code and add test. * do it for miccah.	2023-08-17 15:47:11 -07:00
ahrav	b8bb94f2b1	[bug] - copy chunk before sending on chunksChan (#1633 ) * Redclare chunk before sending on chunksChan. * add integration test. * update test.	2023-08-16 16:36:38 -07:00
Miccah	fae54c7ffa	Add ScanChunk to allow injecting Chunks into the SourceManager's channel (#1634 ) With the introduction of the SourceManager, the chunks channel became private and read-only. This provides a method to write chunks into the channel as we transition away from needing to do that.	2023-08-16 16:09:23 -07:00
Zubair Khan	db89e345d7	correct logging output for github comments and add oss flags (#1632 ) * correct logging output * add flags * respect oss cli flags for github comment scanning * improve copy	2023-08-16 18:23:59 -04:00
joeleonjr	fa9469cfc7	Docker scanning by digest (#1615 ) * added functionality to scan docker images with digests instead of tags * cleaned import statement * added unit test for baseAndTag parsing + remote digest scan	2023-08-11 16:53:12 -05:00
ahrav	e894540632	Use the common chunker for scanning the filesystem source (#1619 ) * Use the common chunker for scanning the filesystem source. * remove unused conts. * add test.	2023-08-11 13:40:10 -07:00
Bill Rich	2d2595a2e3	Move commits_scanned to ScanRepo (#1610 )	2023-08-07 14:28:57 -07:00
ahrav	13999227b9	Use common chunk reader (#1596 ) * Add common chunker. * add comment. * use better config name. * Add common chunk reader to s3. * Add common chunk reader to git, gcs, circleci. * revert gcs. * revert gcs. * fix chunker. * revert gcs. * update cancellablewrite. * revert impl. * update to remove totalsize. * Fix my goof. * Use unified struct in chunkreader. * return err instead of logging and returning. * rename error to err. * only send single ChunkResult even if there is an error and chunkBytes. * fix logic.	2023-08-07 12:55:28 -07:00
Miccah	1cd600f70f	Use SourceManager in engine (#1586 ) * Add SourceManager to Engine struct * Update Engine methods to use the SourceManager * Fix GCS test The original was testing that `Init()` errors weren't surfaced in `Finish()`, but the `SourceManager` changed that behavior. * JobProgress race fixes * Add contextual values * Remove unused code * Add debug logs * Rename WithConcurrency to WithConcurrentSources * Always forward chunks to the output chunks channel	2023-08-03 13:36:30 -05:00
Miccah	e322c4b29d	Fix nil pointer dereference to git ScanOptions (#1603 )	2023-08-03 12:07:24 -05:00
Savely Krasovsky	d062834997	initial support for bare repositories (#1499 ) * feat: initial support for bare repositories * feat: use concatenation instead of formatting and os.Getenv instead of os.Environ Signed-off-by: Savely Krasovsky <savely@krasovs.ky> * fix: go-git update with pre-receive hooks fix Signed-off-by: Savely Krasovsky <savely@krasovs.ky> * fix: remove info about pre-receive hook from README.md for now Signed-off-by: Savely Krasovsky <savely@krasovs.ky> * fix: don't scan staged while using --bare option, fixes to make it work with the latest master Signed-off-by: Savely Krasovsky <savely@krasovs.ky> * fix: small refactor according to #1518 Signed-off-by: Savely Krasovsky <savely@krasovs.ky> --------- Signed-off-by: Savely Krasovsky <savely@krasovs.ky>	2023-08-03 11:23:41 -05:00
ahrav	5a5e8a607e	Common chunk reader (#1594 ) * Add common chunker. * add comment. * use better config name. * Add common chunk reader to s3. * Add common chunk reader to git, gcs, circleci. * fix chunker. * revert gcs. * update cancellablewrite. * revert impl. * update to remove totalsize.	2023-08-03 06:27:33 -07:00
Bill Rich	c995e93dcc	Add commits scanned to log (#1600 ) * Add commits scanned to log * Use atomic	2023-08-02 14:10:54 -07:00
ahrav	78d06658ca	Dont return in loop. (#1589 )	2023-08-01 10:29:01 -07:00
Miccah	69021f59c5	Refactor git source to allow ScanOptions and use source in engine (#1518 ) * Refactor git source to allow ScanOptions and use source in engine Refactor the Chunks method of the git Source to call out to two helper methods: scanRepos and scanDirs which scans s.conn.Repositories and s.conn.Directories respectively. The only notable change in behavior is that a credential is no longer necessary if there are no s.conn.Repositories to scan. * Preserve ScanGit functionality of not cleaning up temporary files	2023-08-01 09:52:02 -05:00
ahrav	5043fc8756	[bug] - Fix unlocking an unlocked mutex (#1583 ) * use correct mutext. * remove unused fxn.	2023-07-31 14:06:41 -07:00
ahrav	eb00d0d4e1	[bug] - fix data races (#1577 ) * fix data race. * Add test and fix additional data race. * address comments.	2023-07-31 11:12:38 -07:00
Miccah	a07b6664f8	Support fatal errors in job reports (#1562 ) * Support fatal errors in job reports * WIP: JobReporter and JobInspector * WIP: JobReportHook and JobReportRef * Add ChunkError type and asyncRun helper method * Rename JobReport to JobProgress * Return a closed channel from Done when the JobProgress is nil * Comment catchFirstFatal function	2023-07-31 11:28:30 -05:00
Cody Rose	ad57de50cd	Do not nest transports for Github installation client (#1564 ) #1454 modified one of the Github enumeration code paths in a way that broke an integration test by causing one client's transport to be used for the construction of a different client, causing authentication failures. This saves the original transport for use, fixing the test.	2023-07-31 11:31:16 -04:00
Richard Gomez	e0faac8d1c	Fix runtime error when scanning Gist comments (#1552 ) * fix(github): fix runtime error from gist comments * fix(github): add flag to scan Gist comments	2023-07-31 08:57:42 -05:00
Miccah	e391e89f3e	Initial implementation of JobReport with SourceManager usage (#1557 ) * Initial implementation of JobReport with SourceManager usage * Limit concurrent units * Only save the last JobReport per handle	2023-07-27 10:49:56 -05:00
Richard Gomez	46823f77c9	feat(github): clarify comment log statement (#1553 )	2023-07-26 09:40:30 -05:00
Miccah	10f0963bc9	Add SourceManager tests for Run and Wait methods (#1530 ) * Miscellaneous SourceManager updates * Own the chunks channel instead of accepting it as an input * Add Chunks and Wait methods * Fix bug in Enroll so it actually returns the handle * Add context.Context parameter to the SourceInitFunc type * Add SourceManager tests for Run and Wait methods * Rename man variables to mgr	2023-07-26 00:48:28 -05:00
Richard Gomez	2290954b02	fix(github): use apiEndpoint for basic or no auth (#1454 )	2023-07-25 20:03:08 -07:00
Bill Rich	f39303495a	Add commitsScanned metrics (#1533 ) * Add commitsScanned metrics * Just keep commit count	2023-07-25 11:31:01 -07:00
ahrav	b5b01d3eba	[chore] - optimize chunker (#1535 ) * Use chunkbytes that includes the size of peek. * linter. * continue. * add TotalChunkSize const.	2023-07-24 19:30:29 -07:00
ahrav	9e0a2e9ddd	[chore] - Remove password info from log (#1528 ) * Remove password info from log. * update. * one more.	2023-07-22 20:25:45 -07:00
Miccah	91c5472876	Implement SourceManager basics (#1515 ) * Implement SourceManager basics * Rename identifiers and add a default headlessAPI implementation * Rewrite to use SourceInitFunc * Update variable name to accurately reflect its value	2023-07-21 15:20:25 -05:00
Miccah	4e774d1f01	Define SourceUnit chunking interface (#1484 ) * Define SourceUnit chunking interface * Refactor to use a ChunkReporter interface * Rename shadowed err to scanErr	2023-07-13 14:11:43 -05:00
Miccah	4b7f94dea1	Rewrite SourceUnitEnumerator to use UnitReporter instead of a channel (#1485 )	2023-07-13 13:48:33 -05:00
Richard Gomez	1594fddf05	feat(git): include line in github & gitlab links (#1466 )	2023-07-11 20:02:27 -07:00
Zubair Khan	4334af4d34	scan GitHub PR and issue comments (#1435 ) * issue comment scanning * save progress * test * test for pr comment and issue comment * add pagination support * linter stuff * make linter happy * remove debug log * readd logging * github issue resolved * var const block and handle rate limit * remove magic number * make gitURLParse a public function to use more generally * fix test bug * make comment scanning OPT-IN	2023-07-11 15:13:33 -04:00
Miccah	5c0ffda618	Define SourceUnit enumeration interface (#1428 ) * Add CancellableWrite helper function * Create SourceUnitEnumerator interface and EnumerationResult struct * Implement SourceUnitEnumerator for the filesystem Source * Omit explicit zero values	2023-07-10 15:05:40 -05:00
Zachary Rice	452734adc8	remove head from git diff command, rename unstaged to staged (#1439 )	2023-06-29 15:33:30 -05:00
Zachary Rice	4a77688097	use stringer again for now (#1430 )	2023-06-26 14:33:54 -05:00
trufflesteeeve	11bff81def	Use url redaction in git (#1399 ) Co-authored-by: Zachary Rice <zachary.rice@trufflesec.com>	2023-06-26 13:56:08 -05:00
Brendan Shaklovitz	da5301ea1e	Exit with non-zero exit code on chunk source error (#1286 ) * Exit with non-zero exit code on chunk source error * Exit with a non-zero exit code whenever we hit an error getting chunks. Previously the error would be logged but trufflehog would exit with a 0 (success) status code. * fix gcs test --------- Co-authored-by: Dustin Decker <dustin@trufflesec.com> Co-authored-by: ahrav <ahravdutta02@gmail.com>	2023-06-26 11:39:57 -05:00
Miccah	f3152b6885	Implement SourceUnitUnmarshaller for all sources (#1416 ) * Implement CommonSourceUnitUnmarshaller * Add SourceUnitUnmarshaller to all sources using All sources, with the exception of git, will use the CommonSourceUnit as they only contain a single type of unit to scan. * Fix method comments to adhere to Go's style guide	2023-06-23 11:15:51 -05:00
Dustin Decker	e856a6890d	🎉 Add Docker image scanning 🎉 (#1412 ) * Add Docker source * Add metrics * Add test * Add debugging, address PR comments, fix path output * review suggestions	2023-06-22 08:02:25 -07:00
dillonstreator	648ef3b52c	fix spelling errors (#1413 )	2023-06-21 07:15:28 -07:00
Miccah	e12f0f84a1	Setup SourceUnit interface (#1393 ) * Test: Asymmetrical unmarshal API * Test: Symmetric marshal API * Revert "Test: Symmetric marshal API" This reverts commit `f51c64a797`. * Cleanup test example and add SourceUnitUnmarshaller interface * Add CommonSourceUnit implementation * Update comments * Remove UnmarshalJSON	2023-06-16 10:38:28 -05:00
Bill Rich	401688d0a8	Add Validator interface and example (#1397 ) * Add Validator interface and example * Close sockets and improve error messages * Remove duplicate error * Use var declaration so err slice can be nil	2023-06-15 08:24:32 -07:00
Bill Rich	c2e3e7d53a	Split files instead of using ReadAll (#1387 ) * Split files instead of using ReadAll * Remove dup chunk * Actually break out of loop	2023-06-12 14:09:05 -07:00
Dustin Decker	572cb0e5dc	Loosen up version check for git	2023-06-01 12:17:48 -07:00
Dustin Decker	183037ab34	Check that git meets version requirements (#1373 )	2023-06-01 09:41:06 -07:00
Dustin Decker	c8944825de	Surface missing git as an error during initialization (#1362 )	2023-05-26 15:23:08 -07:00
ahrav	1da7720912	Replace context.TODO. (#1349 )	2023-05-19 11:09:51 -07:00
ahrav	31844b12e3	[oc-313] - Add GitHub metrics (#1324 ) * Normalize repos during enumeration. * fix test. * Add benchmark. * Add benchmark. * Add more realistic benchmark values. * add gist mocks. * Remove old normalize fxn. * abstract away the repo cache. * update test. * increase repo count. * increase page limnit to 100. * move callee fxns below caller for Chunks. * Add context to normalize. * remove extra logic in normalize repo. * Delete new.txt * Delete old.txt * Handle errors in a thread safe manner. * fix test.' * fix test. * handle repos that are included by users. * Abstract include ignore logic within repoCache. * Add better comment around repoCache. * Rename params. * remove commented out code. * use repos instead of items. * remove commented out code. * Use ++ instead of atomic increment. * update to use logger var. * use cache pkg. * Use separate file for repo logic. * Address comments. * fix test. * make less sucky test. * Update test. * Add logs for duration and repo size. * fix integration test. * address comment.	2023-05-16 08:45:28 -07:00
Dustin Decker	4250773e92	GitHub basic auth (#1337 )	2023-05-15 22:04:42 -07:00
ahrav	6db770fbe5	use md5 hash for checking if key exists. (#1257 )	2023-05-15 10:04:14 -07:00
ahrav	948828ba8c	[chore] - move objectManager interface (#1332 ) * Relocate the objectManager interface to the consumer package as per Go best practices. * address comment.	2023-05-15 09:30:26 -07:00
Brendan Shaklovitz	fad34d4dc6	git worktree scanning fix for #827 (#1315 ) * Fix worktree scan by setting EnableDotGitCommonDir * Change `PlainOpenOptions` to set `EnableDotGitCommonDir` to true. In every current usage of this function, it is on an already-cloned repository, so it should always be valid to have this set. By doing so, it should fix some issues with worktrees. * Remove unused go.mod replace directives * Remove replace directives for libraries that are not in use.	2023-05-09 08:00:47 -07:00
Bill Rich	f2924f3061	Make sure context lines are properly handled (#1331 ) * Make sure context lines are properly handled * Fix git test to account for context change	2023-05-05 12:51:27 -07:00
ahrav	030c093392	Fix how we scan orgs (#1327 ) * Fix how we scan orgs. * fix integration test.	2023-05-04 08:07:11 -07:00
Brendan Shaklovitz	be4147a24e	Output git timestamps as UTC times (#1323 )	2023-05-03 11:47:00 -05:00
ahrav	323c093818	Normalize GitHub repos during enumeration (#1269 ) * Normalize repos during enumeration. * fix test. * Add benchmark. * Add benchmark. * Add more realistic benchmark values. * add gist mocks. * Remove old normalize fxn. * abstract away the repo cache. * update test. * increase repo count. * increase page limnit to 100. * move callee fxns below caller for Chunks. * Add context to normalize. * remove extra logic in normalize repo. * Delete new.txt * Delete old.txt * Handle errors in a thread safe manner. * fix test.' * fix test. * handle repos that are included by users. * Abstract include ignore logic within repoCache. * Add better comment around repoCache. * Rename params. * remove commented out code. * use repos instead of items. * remove commented out code. * Use ++ instead of atomic increment. * update to use logger var. * use cache pkg. * Address comments. * fix test. * make less sucky test. * Update test.	2023-05-03 08:35:53 -07:00
ahrav	67972683ea	[chore] - format log msg (#1299 ) * format log msg. * snake. * lowercase repo.	2023-04-27 17:14:00 -07:00
ahrav	a2266b4e28	add additional logging (#1298 ) * add additional logging. * update test. * remove continue. * address comments.	2023-04-27 16:48:04 -07:00
Brendan Shaklovitz	10902f802a	Add max object size flag for s3 bucket scanning (#1294 ) Co-authored-by: Dustin Decker <dustin@trufflesec.com>	2023-04-26 15:39:43 -07:00
Dustin Decker	97ce27153a	[]bytes were being logged as b64ed string (#1255 )	2023-04-14 06:43:26 -07:00
ahrav	461f1a631e	[chore] - use hex encode vs base64 (#1256 ) * use hex encode vs base64. * fix tests.	2023-04-13 19:16:06 -07:00

1 2 3 4 5 ...

436 commits