Commit graph

2495 commits

Author SHA1 Message Date
Avinash Shenoy
1442f333c2
Parallelised updating Alexa.com ranking of sites
Script now fetches Alexa ranks for sites concurrently on separate threads. Cuts down the time to sync ranks from approximately **5 minutes** to about **18 seconds**.
2019-01-27 15:01:55 +05:30
Yahya SayadArbabi
269df6d549
Merge pull request #151 from ptalmeida/master
Fix readme and instrallpackages.sh typo
2019-01-26 16:15:05 +03:30
ptalmeida
8ee50e6717 Fix typo
necessery -> necessary
2019-01-26 11:20:36 +00:00
ptalmeida
85d7be3e77 Actually bring README.md up to date 2019-01-26 11:14:05 +00:00
Yahya SayadArbabi
d6b7c0ac55 Merge branch 'ptalmeida-Add-sorgin-by-alexa-rank-functionality' 2019-01-26 14:23:51 +03:30
ptalmeida
8b681158bc small corrections to rank sort 2019-01-25 17:36:38 +00:00
ptalmeida
78ade00dee Update outdated REAME.md 2019-01-25 15:10:03 +00:00
ptalmeida
5d972a3138 add --rank -r option to sherlock 2019-01-25 15:05:38 +00:00
ptalmeida
55d43b0ee6 Update requirements.txt 2019-01-25 12:50:50 +00:00
ptalmeida
db0cf7c289 Update requirements.txt 2019-01-25 12:46:05 +00:00
ptalmeida
826af1ec19 remove unused import 2019-01-25 12:45:55 +00:00
Yahya SayadArbabi
2408bb520e Merge branch 'UltraWelfare/optional_output' 2019-01-25 02:27:02 +03:30
George Tsomlektsis
0e6b8d0dca Added optional parameters for outputting files and folders. 2019-01-24 21:59:06 +02:00
George Tsomlektsis
f511faab23 Added the ability to load external json files. 2019-01-24 21:16:49 +02:00
ptalmeida
0b96141df0 Merge remote-tracking branch 'upstream/master' 2019-01-24 15:41:41 +00:00
ptalmeida
9c45146da1 remove unused import 2019-01-24 12:54:09 +00:00
ptalmeida
cc2b1cb27a Improve terminal appearence for site_list.py 2019-01-24 12:50:02 +00:00
ptalmeida
40fc51fc32 add rank paramether to site_list.py
--rank or -r to update all page ranks
2019-01-24 12:35:08 +00:00
ptalmeida
ad4c321f3f Correct print look 2019-01-24 11:16:23 +00:00
ptalmeida
04ee711872 Merge remote-tracking branch 'upstream/master' 2019-01-24 11:13:43 +00:00
ptalmeida
46705ad0a1 Add Alexa.com rank updating functionality to site_list.py 2019-01-24 11:01:34 +00:00
Christopher Kent Hoadley
eae4da7294
Merge pull request #144 from TheYahya/hoadlck-fix-more-sites
Fix EyeEm/CreativeMarket/EVE Online False Positives
2019-01-23 19:22:53 -06:00
Christopher K. Hoadley
0709fd3947 Change EyeEm/CreativeMarket/EVE Online detection method to use the newly refurbished "response_url" detection method. 2019-01-23 19:16:55 -06:00
Christopher K. Hoadley
b237b8fe02 Problems with false positives on EyeEm/CreativeMarket/EVE Online. Add these sites to the tests to capture failure. 2019-01-23 19:13:09 -06:00
Yahya SayadArbabi
eec1779a3f
Merge pull request #143 from Czechball/master
Updated asciinema
2019-01-24 01:24:09 +03:30
Czechball
4e11364508 Updated asciinema 2019-01-23 22:23:05 +01:00
Yahya SayadArbabi
cd50d34e2e Merge branch 'ptalmeida-master' 2019-01-23 23:03:14 +03:30
Yahya SayadArbabi
7a93f73b23 bump version 2019-01-23 23:03:01 +03:30
Yahya SayadArbabi
f641649be7 Sort "sites.md" 2019-01-23 23:02:35 +03:30
Pedro Almeida
1ed24f5993
correct code 2019-01-23 14:19:36 +00:00
ptalmeida
2ef168d924 Add website sorting code to site_list.py
Includes a fix in indentation
2019-01-23 14:14:37 +00:00
ptalmeida
92479c83e4 Sort websites alphabetically 2019-01-23 13:07:51 +00:00
Yahya SayadArbabi
adab51e580
Merge pull request #138 from sdushantha/master
centered the asciinema image
2019-01-23 13:11:13 +03:30
Yahya SayadArbabi
60f968d919
Merge pull request #137 from sdushantha/patch-1
replaced % with $ in README.md
2019-01-23 13:10:34 +03:30
Siddharth Dushantha
f1ac90b9ae
centered the asciinema image 2019-01-23 10:03:25 +01:00
Siddharth Dushantha
32920d7c23
replaced % with $
All of the other commands start with $, so it is better to be consistant.
2019-01-23 09:58:30 +01:00
Yahya SayadArbabi
a7fff6799d
Merge pull request #136 from sdushantha/master
added @sdushantha as the original creator
2019-01-23 09:15:01 +03:30
Siddharth Dushantha
9155d9d490
added @sdushantha as the original creator 2019-01-23 06:29:08 +01:00
Christopher K. Hoadley
a18065c1a7 Bump version. Update usage results in readme. 2019-01-22 20:55:10 -06:00
Christopher K. Hoadley
a538c07ab3 Merge branch 'pr/n133_cclauss' 2019-01-22 20:51:32 -06:00
Christopher Kent Hoadley
8dfe820067
Merge pull request #135 from TheYahya/hoadlck-response_url-site-fixes
Response URL Site Detection Strategy Overhaul
2019-01-22 20:45:46 -06:00
Christopher K. Hoadley
65b38592c4 Change "response_url" detection strategy completely.
Previously, there was a problem with sites that redirect an attempt to view a non-existing username to the main site. For example, if you try to go to https://devrant.com/users/dfoxxxxxxxxx (a user name that does not exist), then we get a redirect to the https://devrant.com/ root of the site. But, the "response_url" checking algorithm was only looking for the configured error URL being included in the response.  So, these sites always indicated that the username was not found.

Update the "response_url" detection method so that the request does not allow redirects. If we get a 200 response of some type, then the username has been found. However, if we get something like a 302, then we know that the username was not found as we are being redirected.

This whole method seems fragile, but I did exhaustively test all of the supported sites, and they all work.  So, this change is clearly an improvement.
2019-01-22 20:37:05 -06:00
Christopher K. Hoadley
bb66d6a992 Update Pinterest and WordPress user URLs to exactly match what the site ends up with. If the request does not have the trailing "/", then the site will forward us to that URL. 2019-01-22 20:19:34 -06:00
Christopher K. Hoadley
ae657460c8 Add coverage tests for all sites that use the response URL detection method. This test fails because Sherlock does not handle all of these sites properly. 2019-01-22 20:16:19 -06:00
Yahya SayadArbabi
fba27cd709
Merge pull request #134 from Czechball/master
Added asciinema demo
2019-01-23 01:32:25 +03:30
Czechball
6441005784 Merge https://github.com/TheYahya/sherlock 2019-01-22 21:03:37 +01:00
cclauss
0b3c5f2ba0
Fix the top of the deerstalker
https://en.m.wikipedia.org/wiki/Deerstalker
2019-01-22 20:15:13 +01:00
cclauss
a380e48057
Make BANNER an r'string' and move it up front
Sherlock has a cool logo so this PR moves it to the top of sherlock.py to show it off when potential contributors read our source.  It also makes a BANNER an __r'string'__ so we do not need backslashes to escape characters and the ascii art is more what-you-see-is-what-you-get.  This also suppresses [a bunch of flake8 complaints](https://travis-ci.com/TheYahya/sherlock/jobs/172028552#L490-L503) about _invalid escape sequences_.  Finally this PR runs isort on the imports for readability.  Standard Lib imports come first, etc.
2019-01-22 18:55:03 +01:00
Christopher Kent Hoadley
cb715a6447
Add Travis CI Status Image 2019-01-22 11:44:18 -06:00
Christopher Kent Hoadley
560f7991d0
Merge pull request #130 from cclauss/patch-5
Use Travis Continuous Integration to test all code changes
2019-01-22 11:27:25 -06:00