maravento/blackweb

GitHub: maravento/blackweb

BlackWeb 是一个专为 Squid-Cache 优化的域名黑名单聚合项目,整合百余个公开来源的恶意、成人、赌博等分类域名,提供企业级网页过滤能力。

Stars: 388 | Forks: 83

# [BlackWeb](https://www.maravento.com/p/blackweb.html) [![status-maintained](https://img.shields.io/badge/status-maintained-purple.svg)](https://github.com/maravento/blackweb) [![last commit](https://img.shields.io/github/last-commit/maravento/blackweb)](https://github.com/maravento/blackweb) [![Ask DeepWiki](https://deepwiki.com/badge.svg)](https://deepwiki.com/maravento/blackweb) [![Twitter Follow](https://img.shields.io/twitter/follow/maraventostudio.svg?style=social)](https://twitter.com/maraventostudio)
English | Español
BlackWeb 是一个收集并统一公开域名黑名单(成人内容、下载、毒品、恶意软件、间谍软件、追踪器、机器人、社交网络、盗版软件、武器等)的项目,旨在使其兼容 [Squid-Cache](http://www.squid-cache.org/)。 ## 数据表 | ACL | 已拦截域名 | 文件大小 | | :---: | :---: | :---: | | blackweb.txt | 4772375 | 118,8 MB | ## GIT 克隆 ``` git clone --depth=1 https://github.com/maravento/blackweb.git ``` ## 如何使用 `blackweb.txt` 已更新并针对 [Squid-Cache](http://www.squid-cache.org/) 进行了优化。将其下载并解压到您选择的路径中,然后激活 [Squid-Cache 规则](https://github.com/maravento/blackweb#regla-squid-cache--squid-cache-rule)。 ### 下载 ``` wget -q -c -N https://raw.githubusercontent.com/maravento/blackweb/master/blackweb.tar.gz && cat blackweb.tar.gz* | tar xzf - ``` #### 如果存在多部分文件 ``` #!/bin/bash # 变量 url="https://raw.githubusercontent.com/maravento/blackweb/master/blackweb.tar.gz" wgetd="wget -q -c --timestamping --no-check-certificate --retry-connrefused --timeout=10 --tries=4 --show-progress" # TMP 文件夹 output_dir="bwtmp" mkdir -p "$output_dir" # 下载 if $wgetd "$url"; then echo "File downloaded: $(basename $url)" else echo "Main file not found. Searching for multiparts..." # Multiparts from a to z all_parts_downloaded=true for part in {a..z}{a..z}; do part_url="${url%.*}.$part" if $wgetd "$part_url"; then echo "Part downloaded: $(basename $part_url)" else echo "Part not found: $part" all_parts_downloaded=false break fi done if $all_parts_downloaded; then # Rebuild the original file in the current directory cat blackweb.tar.gz.* > blackweb.tar.gz echo "Multipart file rebuilt" else echo "Multipart process cannot be completed" exit 1 fi fi # 解压文件到输出文件夹 tar -xzf blackweb.tar.gz -C "$output_dir" echo "Done" ``` ### 校验和 ``` wget -q -c -N https://raw.githubusercontent.com/maravento/blackweb/master/blackweb.tar.gz && cat blackweb.tar.gz* | tar xzf - wget -q -c -N https://raw.githubusercontent.com/maravento/blackweb/master/blackweb.txt.sha256 LOCAL=$(sha256sum blackweb.txt | awk '{print $1}'); REMOTE=$(awk '{print $1}' blackweb.txt.sha256); echo "$LOCAL" && echo "$REMOTE" && [ "$LOCAL" = "$REMOTE" ] && echo OK || echo FAIL ``` ### BlackWeb 规则(适用于 [Squid-Cache](http://www.squid-cache.org/)) 编辑: ``` /etc/squid/squid.conf ``` 并添加以下行: ``` # 在此插入您自己的规则以允许客户端访问 # Blackweb 拦截规则 acl blackweb dstdomain "/path_to/blackweb.txt" http_access deny blackweb ``` #### 高级规则 BlackWeb 包含数百万个域名,因此建议: ##### 域名允许规则 ``` acl allowdomains dstdomain "/path_to/allowdomains.txt" http_access allow allowdomains ``` ##### 域名拦截规则 ``` acl blockdomains dstdomain "/path_to/blockdomains.txt" http_access deny blockdomains ``` ##### gTLD, sTLD, ccTLD 等拦截规则 ``` acl blocktlds dstdomain "/path_to/blocktlds.txt" http_access deny blocktlds ``` 输入: ``` .bardomain.xxx .subdomain.bardomain.xxx .bardomain.ru .bardomain.adult .foodomain.com .foodomain.porn ``` 输出: ``` .foodomain.com ``` ##### Punycode 拦截规则 ``` acl punycode dstdom_regex -i \.xn--.* http_access deny punycode ``` 输入: ``` .bücher.com .mañana.com .google.com .auth.wikimedia.org .xn--fiqz9s .xn--p1ai ``` ASCII 输出: ``` .google.com .auth.wikimedia.org ``` ##### 关键词拦截规则 ``` # 下载 ACL: sudo wget -P /etc/acl/ https://raw.githubusercontent.com/maravento/vault/refs/heads/master/blackshield/acl/squid/blockwords.txt # 拦截关键词的 Squid 规则: acl blockwords url_regex -i "/etc/acl/blockwords.txt" http_access deny blockwords ``` 输入: ``` .bittorrent.com https://www.google.com/search?q=torrent https://www.google.com/search?q=mydomain https://www.google.com/search?q=porn .mydomain.com ``` 输出: ``` https://www.google.com/search?q=mydomain .mydomain.com ``` ##### 流媒体(可选) ``` acl streaming dstdomain "/path_to/streaming.txt" http_access deny streaming ``` ``` # 拦截 Facebook .fbcdn.net .facebook.com # 拦截部分 Facebook 流媒体内容 .z-p3-video.flpb1-1.fna.fbcdn.net ``` #### 高级规则摘要 ``` # 在此插入您自己的规则以允许客户端访问 # 域名允许规则 acl allowdomains dstdomain "/path_to/allowdomains.txt" http_access allow allowdomains # Punycode 拦截规则 acl punycode dstdom_regex -i \.xn--.* http_access deny punycode # gTLD, sTLD, ccTLD 拦截规则 acl blocktlds dstdomain "/path_to/blocktlds.txt" http_access deny blocktlds # 域名拦截规则 acl blockdomains dstdomain "/path_to/blockdomains.txt" http_access deny blockdomains # Patterns 拦截规则 (可选) # https://raw.githubusercontent.com/maravento/vault/refs/heads/master/blackshield/acl/squid/blockpatterns.txt acl blockwords url_regex -i "/path_to/blockpatterns.txt" http_access deny blockwords # web3 拦截规则 (可选) # https://raw.githubusercontent.com/maravento/vault/refs/heads/master/blackshield/acl/web3/web3domains.txt acl web3 dstdomain "/path_to/web3domains.txt" http_access deny web3 # Blackweb 拦截规则 acl blackweb dstdomain "/path_to/blackweb.txt" http_access deny blackweb ``` ## BLACKWEB 更新 ### ⚠️ 警告:在继续之前 本节仅解释更新和优化过程是如何工作的。用户无需运行它。此过程可能需要时间并消耗大量硬件和带宽资源,因此建议使用测试设备。 #### Bash 更新 ``` wget -q -N https://raw.githubusercontent.com/maravento/blackweb/master/bwupdate/bwupdate.sh && chmod +x bwupdate.sh && ./bwupdate.sh ``` #### 依赖项 ``` wget git curl libnotify-bin perl tar rar unrar unzip zip gzip python-is-python3 idn2 iconv ``` ``` #!/bin/bash # 终止旧版本 while pgrep squid > /dev/null; do echo "Waiting for Squid to stop..." killall -s SIGTERM squid &>/dev/null sleep 5 done # 移除 squid (如果存在) apt purge -y squid* &>/dev/null rm -rf /var/spool/squid* /var/log/squid* /etc/squid* /dev/shm/* &>/dev/null # 安装 squid (可以使用 'squid-openssl' 或 'squid') apt install -y squid-openssl squid-langpack squid-common squidclient squid-purge # 创建日志 if [ ! -d /var/log/squid ]; then mkdir -p /var/log/squid fi &>/dev/null if [[ ! -f /var/log/squid/{access,cache,store,deny}.log ]]; then touch /var/log/squid/{access,cache,store,deny}.log fi &>/dev/null # 权限 chown -R proxy:proxy /var/log/squid # 启用服务 systemctl enable squid.service systemctl start squid.service echo "Done" ``` #### 获取公开黑名单 #### 域名调试 输入: ``` com .com .domain.com domain.com 0.0.0.0 domain.com 127.0.0.1 domain.com ::1 domain.com domain.com.co foo.bar.subdomain.domain.com .subdomain.domain.com.co www.domain.com www.foo.bar.subdomain.domain.com domain.co.uk xxx.foo.bar.subdomain.domain.co.uk ``` 输出: ``` .domain.com .domain.com.co .domain.co.uk ``` #### TLD 验证 输入: ``` .domain.exe .domain.com .domain.edu.co ``` 输出: ``` .domain.com .domain.edu.co ``` #### Punycode-IDN 调试 输入: ``` bücher.com café.fr españa.com köln-düsseldorfer-rhein-main.de mañana.com mūsųlaikas.lt sendesık.com президент.рф ``` 输出: ``` xn--bcher-kva.com xn--caf-dma.fr xn--d1abbgf6aiiy.xn--p1ai xn--espaa-rta.com xn--kln-dsseldorfer-rhein-main-cvc6o.de xn--maana-pta.com xn--mslaikas-qzb5f.lt xn--sendesk-wfb.com ``` #### 非 ASCII 字符调试 输入: ``` M-C$ -$ .$ 0$ 1$ 23andmê.com .òutlook.com .ălibăbă.com .ămăzon.com .ăvăst.com .amùazon.com .amÉ™zon.com .avalón.com .bĺnance.com .bitdẹfender.com .blóckchain.site .blockchaiǹ.com .cashpluÈ™.com .dẹll.com .diócesisdebarinas.org .disnẹylandparis.com .ebăy.com .É™mÉ™zon.com .evo-bancó.com .goglÄ™.com .gooÄŸle.com .googļę.com .googlÉ™.com .google.com .ibẹria.com .imgúr.com .lloydÅŸbank.com .mýetherwallet.com .mrgreÄ™n.com .myẹthẹrwallet.com .myẹthernwallet.com .myethẹrnwallet.com .myetheá¹™wallet.com .myethernwallẹt.com .nÄ™tflix.com .paxfùll.com .türkiyeisbankasi.com .tÅ™ezor.com .westernúnion.com .yòutube.com .yăhoo.com .yoütübe.co .yoütübe.com .yoütu.be ``` 输出: ``` .google.com ``` #### DNS 查询 ``` PROCS=$(($(nproc))) # Conservative (network-friendly) PROCS=$(($(nproc) * 2)) # Balanced PROCS=$(($(nproc) * 4)) # Aggressive (default) PROCS=$(($(nproc) * 8)) # Extreme (8 or higher, use with caution) ``` ``` nproc → 8 PROCS=$((8 * 4)) → 32 parallel queries ``` ``` Processed: 2463489 / 7244989 (34.00%) ``` 输出: ``` HIT google.com google.com has address 142.251.35.238 google.com has IPv6 address 2607:f8b0:4008:80b::200e google.com mail is handled by 10 smtp.google.com. FAULT testfaultdomain.com Host testfaultdomain.com not found: 3(NXDOMAIN) ``` #### 排除政府相关的 TLD 输入: ``` .argentina.gob.ar .mydomain.com .gob.mx .gov.uk .navy.mil ``` 输出: ``` .mydomain.com ``` #### 使用 BlackWeb 运行 Squid-Cache #### 检查执行情况 (/var/log/syslog) ``` BlackWeb: Done 06/05/2023 15:47:14 ``` #### 关于 BlackWeb 更新的重要说明 - BlackWeb 的默认路径是 `/etc/acl`。您可以根据偏好进行更改。 - 如果您需要中断 `bwupdate.sh` 的执行(ctrl + c)且它停在 [DNS 查询](https://github.com/maravento/blackweb#dns-loockup) 部分,它将从该点重新启动。如果您在此之前停止它,则必须从头开始,或者手动修改脚本以使其从所需点开始。 - 如果您使用 `aufs`,请在升级期间临时将其更改为 `ufs`,以避免:`ERROR: Can't change type of existing cache_dir aufs /var/spool/squid to ufs. Restart required`。 ## 来源 ### 黑名单 - [ABPindo - indonesianadblockrules](https://raw.githubusercontent.com/ABPindo/indonesianadblockrules/master/subscriptions/abpindo.txt) - [abuse.ch - hostfile](https://urlhaus.abuse.ch/downloads/hostfile/) - [Adaway - host](https://adaway.org/hosts.txt) - [adblockplus - advblock Russian](https://easylist-downloads.adblockplus.org/advblock.txt) - [adblockplus - antiadblockfilters](https://easylist-downloads.adblockplus.org/antiadblockfilters.txt) - [adblockplus - easylistchina](https://easylist-downloads.adblockplus.org/easylistchina.txt) - [adblockplus - easylistlithuania](https://easylist-downloads.adblockplus.org/easylistlithuania+easylist.txt) - [anudeepND - adservers](https://raw.githubusercontent.com/anudeepND/blacklist/master/adservers.txt) - [anudeepND - coinminer](https://raw.githubusercontent.com/anudeepND/blacklist/master/CoinMiner.txt) - [AssoEchap - stalkerware-indicators](https://raw.githubusercontent.com/AssoEchap/stalkerware-indicators/master/generated/hosts) - [azet12 - KADhosts](https://raw.githubusercontent.com/azet12/KADhosts/master/KADhosts.txt) - [BarbBlock - blacklists](https://paulgb.github.io/BarbBlock/blacklists/hosts-file.txt) - [BBcan177 - minerchk](https://github.com/BBcan177/minerchk) - [BBcan177 - MS-2](https://gist.github.com/BBcan177/4a8bf37c131be4803cb2) - [BBcan177 - referrer-spam-blacklist](https://github.com/BBcan177/referrer-spam-blacklist) - [betterwebleon - slovenian-list](https://raw.githubusercontent.com/betterwebleon/slovenian-list/master/filters.txt) - [bigdargon - hostsVN](https://raw.githubusercontent.com/bigdargon/hostsVN/master/hosts) - [BlackJack8 - iOSAdblockList](https://raw.githubusercontent.com/BlackJack8/iOSAdblockList/master/Hosts.txt) - [BlackJack8 - webannoyances](https://github.com/BlackJack8/webannoyances/raw/master/ultralist.txt) - [blocklistproject - everything](https://raw.githubusercontent.com/blocklistproject/Lists/master/everything.txt) - [cert.pl - List of malicious domains](https://hole.cert.pl/domains/domains.txt) - [chadmayfield - porn top](https://raw.githubusercontent.com/chadmayfield/pihole-blocklists/master/lists/pi_blocklist_porn_top1m.list) - [chadmayfield - porn_all](https://raw.githubusercontent.com/chadmayfield/my-pihole-blocklists/master/lists/pi_blocklist_porn_all.list) - [chainapsis - phishing-block-list](https://raw.githubusercontent.com/chainapsis/phishing-block-list/main/block-list.txt) - [cjx82630 - Chinese CJX's Annoyance List](https://raw.githubusercontent.com/cjx82630/cjxlist/master/cjx-annoyance.txt) - [cobaltdisco - Google-Chinese-Results-Blocklist](https://raw.githubusercontent.com/cobaltdisco/Google-Chinese-Results-Blocklist/master/GHHbD_perma_ban_list.txt) - [CriticalPathSecurity - Public-Intelligence-Feeds](https://github.com/CriticalPathSecurity/Public-Intelligence-Feeds/) - [DandelionSprout - adfilt](https://raw.githubusercontent.com/DandelionSprout/adfilt/master/Alternate%20versions%20Anti-Malware%20List/AntiMalwareHosts.txt) - [Dawsey21 - adblock-list](https://raw.githubusercontent.com/Dawsey21/Lists/master/adblock-list.txt) - [Dawsey21 - main-blacklist](https://raw.githubusercontent.com/Dawsey21/Lists/master/main-blacklist.txt) - [developerdan - ads-and-tracking-extended](https://www.github.developerdan.com/hosts/lists/ads-and-tracking-extended.txt) - [Disconnect.me - simple_ad](https://s3.amazonaws.com/lists.disconnect.me/simple_ad.txt) - [Disconnect.me - simple_malvertising](https://s3.amazonaws.com/lists.disconnect.me/simple_malvertising.txt) - [Disconnect.me - simple_tracking](https://s3.amazonaws.com/lists.disconnect.me/simple_tracking.txt) - [dorxmi - nothingblock](https://github.com/dorxmi/nothingblock) - [Eallion - uBlacklist](https://raw.githubusercontent.com/eallion/uBlacklist-subscription-compilation/refs/heads/main/uBlacklist.txt) - [EasyList - EasyListHebrew](https://raw.githubusercontent.com/easylist/EasyListHebrew/master/EasyListHebrew.txt) - [ethanr - dns-blacklists](https://bitbucket.org/ethanr/dns-blacklists/raw/master/bad_lists/Mandiant_APT1_Report_Appendix_D.txt) - [fabriziosalmi - blacklists](https://github.com/fabriziosalmi/blacklists/releases/download/latest/blacklist.txt) - [firebog - AdguardDNS](https://v.firebog.net/hosts/AdguardDNS.txt) - [firebog - Admiral](https://v.firebog.net/hosts/Admiral.txt) - [firebog - Easylist](https://v.firebog.net/hosts/Easylist.txt) - [firebog - Easyprivacy](https://v.firebog.net/hosts/Easyprivacy.txt) - [firebog - Kowabit](https://v.firebog.net/hosts/Kowabit.txt) - [firebog - neohostsbasic](https://v.firebog.net/hosts/neohostsbasic.txt) - [firebog - Prigent-Ads](https://v.firebog.net/hosts/Prigent-Ads.txt) - [firebog - Prigent-Crypto](https://v.firebog.net/hosts/Prigent-Crypto.txt) - [firebog - Prigent-Malware](https://v.firebog.net/hosts/Prigent-Malware.txt) - [firebog - RPiList-Malware](https://v.firebog.net/hosts/RPiList-Malware.txt) - [firebog - RPiList-Phishing](https://v.firebog.net/hosts/RPiList-Phishing.txt) - [firebog - WaLLy3K](https://v.firebog.net/hosts/static/w3kbl.txt) - [frogeye - firstparty-trackers-hosts](https://hostfiles.frogeye.fr/firstparty-trackers-hosts.txt) - [gardar - Icelandic ABP List](https://adblock.gardar.net/is.abp.txt) - [greatis - Anti-WebMiner](https://raw.githubusercontent.com/greatis/Anti-WebMiner/master/blacklist.txt) - [hagezi - dns-blocklists](https://raw.githubusercontent.com/hagezi/dns-blocklists/main/domains/ultimate.txt) - [hexxium - threat-list/](https://hexxiumcreations.github.io/threat-list/hexxiumthreatlist.txt) - [hoshsadiq - adblock-nocoin-list](https://raw.githubusercontent.com/hoshsadiq/adblock-nocoin-list/master/hosts.txt) - [jawz101 - potentialTrackers](https://raw.githubusercontent.com/jawz101/potentialTrackers/master/potentialTrackers.csv) - [jdlingyu - ad-wars](https://raw.githubusercontent.com/jdlingyu/ad-wars/master/hosts) - [joelotz - URL_Blacklist](https://raw.githubusercontent.com/joelotz/URL_Blacklist/master/blacklist.csv) - [kaabir - AdBlock_Hosts](https://raw.githubusercontent.com/kaabir/AdBlock_Hosts/master/hosts) - [kevle1 - Windows-Telemetry-Blocklist - xiaomiblock](https://raw.githubusercontent.com/kevle1/Xiaomi-Telemetry-Blocklist/master/xiaomiblock.txt) - [liamja - Prebake Filter Obtrusive Cookie Notices](https://raw.githubusercontent.com/liamja/Prebake/master/obtrusive.txt) - [malware-filter - URLhaus Malicious URL Blocklist](https://gitlab.com/malware-filter/urlhaus-filter/-/raw/master/urlhaus-filter.txt) - [malware-filter.- phishing-filter-hosts](https://malware-filter.gitlab.io/malware-filter/phishing-filter-hosts.txt) - [Matomo-org - spammers](https://github.com/matomo-org/referrer-spam-blacklist/blob/master/spammers.txt) - [MBThreatIntel - malspam](https://github.com/MBThreatIntel/malspam) - [mine.nu - hosts0](https://hostsfile.mine.nu/hosts0.txt) - [mitchellkrogza - Badd-Boyz-Hosts](https://raw.githubusercontent.com/mitchellkrogza/Badd-Boyz-Hosts/master/domains) - [mitchellkrogza - hacked-domains](https://github.com/mitchellkrogza/The-Big-List-of-Hacked-Malware-Web-Sites/blob/master/hacked-domains.list) - [mitchellkrogza - nginx-ultimate-bad-bot-blocker](https://raw.githubusercontent.com/mitchellkrogza/nginx-ultimate-bad-bot-blocker/master/_generator_lists/bad-referrers.list) - [mitchellkrogza - strip_domains](https://raw.githubusercontent.com/mitchellkrogza/The-Big-List-of-Hacked-Malware-Web-Sites/master/.dev-tools/_strip_domains/domains.txt) - [molinero - hBlock](https://hblock.molinero.dev/hosts_domains.txt) - [NanoAdblocker - NanoFilters](https://github.com/NanoAdblocker/NanoFilters) - [neodevpro - neodevhost](https://raw.githubusercontent.com/neodevpro/neodevhost/master/domain) - [notracking - hosts-blocklists](https://raw.githubusercontent.com/notracking/hosts-blocklists/master/hostnames.txt) - [Oleksiig - Squid-BlackList](https://raw.githubusercontent.com/oleksiig/Squid-BlackList/master/denied_ext.conf) - [openphish - feed](https://openphish.com/feed.txt) - [pengelana - domains blocklist](https://github.com/pengelana/blocklist/tree/master/src/blacklist) - [phishing.army - phishing_army_blocklist_extended](https://phishing.army/download/phishing_army_blocklist_extended.txt) - [piperun - iploggerfilter](https://github.com/piperun/iploggerfilter) - [quidsup - notrack-blocklists](https://gitlab.com/quidsup/notrack-blocklists/raw/master/notrack-blocklist.txt) - [quidsup - notrack-malware](https://gitlab.com/quidsup/notrack-blocklists/raw/master/notrack-malware.txt) - [reddestdream - MinimalHostsBlocker](https://reddestdream.github.io/Projects/MinimalHosts/etc/MinimalHostsBlocker/minimalhosts) - [RooneyMcNibNug - pihole-stuff](https://raw.githubusercontent.com/RooneyMcNibNug/pihole-stuff/master/SNAFU.txt) - [Rpsl - adblock-leadgenerator-list](https://github.com/Rpsl/adblock-leadgenerator-list) - [ruvelro - Halt-and-Block-Mining](https://raw.githubusercontent.com/ruvelro/Halt-and-Block-Mining/master/HBmining.bat) - [ryanbr - fanboy-adblock](https://github.com/ryanbr/fanboy-adblock) - [scamaNet - blocklist](https://raw.githubusercontent.com/scamaNet/blocklist/main/blocklist.txt) - [simeononsecurity/System-Wide-Windows-Ad-Blocker](https://raw.githubusercontent.com/simeononsecurity/System-Wide-Windows-Ad-Blocker/main/Files/hosts.txt) - [Someonewhocares - hosts](https://someonewhocares.org/hosts/hosts) - [stanev.org - Bulgarian adblock list](http://stanev.org/abp/adblock_bg.txt) - [StevenBlack - add.2o7Net](https://raw.githubusercontent.com/StevenBlack/hosts/master/data/add.2o7Net/hosts) - [StevenBlack - add.Risk](https://raw.githubusercontent.com/StevenBlack/hosts/master/data/add.Risk/hosts) - [StevenBlack - fakenews-gambling-porn-social](https://raw.githubusercontent.com/StevenBlack/hosts/master/alternates/fakenews-gambling-porn-social/hosts) - [StevenBlack - hosts](https://raw.githubusercontent.com/StevenBlack/hosts/master/hosts) - [StevenBlack - spam](https://raw.githubusercontent.com/StevenBlack/hosts/master/data/add.Spam/hosts) - [StevenBlack - uncheckyAds](https://raw.githubusercontent.com/StevenBlack/hosts/master/data/UncheckyAds/hosts) - [Stopforumspam - Toxic Domains](https://www.stopforumspam.com/downloads/toxic_domains_whole.txt) - [sumatipru - squid-blacklist](https://raw.githubusercontent.com/sumatipru/squid-blacklist/refs/heads/master/blacklist.txt) - [Taz - SpamDomains](http://www.taz.net.au/Mail/SpamDomains) - [tomasko126 - Easylist Czech and Slovak filter list](https://raw.githubusercontent.com/tomasko126/easylistczechandslovak/master/filters.txt) - [txthinking - blackwhite](https://raw.githubusercontent.com/txthinking/blackwhite/master/black.list) - [txthinking - bypass china domains](https://raw.githubusercontent.com/txthinking/bypass/master/china_domain.txt) - [Ultimate Hosts Blacklist - hosts](https://github.com/Ultimate-Hosts-Blacklist/Ultimate.Hosts.Blacklist/tree/master/hosts) - [Université Toulouse 1 Capitole - Blacklists UT1 - Olbat](https://github.com/olbat/ut1-blacklists/tree/master/blacklists) - [Université Toulouse 1 Capitole - Blacklists UT1](https://dsi.ut-capitole.fr/blacklists/index_en.php) - [vokins - yhosts](https://raw.githubusercontent.com/vokins/yhosts/master/hosts) - [Winhelp2002 - hosts](http://winhelp2002.mvps.org/hosts.txt) - [yourduskquibbles - Web Annoyances Ultralist](https://github.com/yourduskquibbles/webannoyances) - [yous - YousList](https://raw.githubusercontent.com/yous/YousList/master/youslist.txt) - [yoyo - Peter Lowe’s Ad and tracking server list](http://pgl.yoyo.org/adservers/serverlist.php?hostformat=nohtml) - [zoso - Adblock List](https://zoso.ro/pages/rolist.txt) ### 调试列表 - [google supported domains](https://www.google.com/supported_domains) - [iana](https://data.iana.org/TLD/tlds-alpha-by-domain.txt) - [ipv6-hosts](https://raw.githubusercontent.com/lennylxx/ipv6-hosts/master/hosts) (部分) - [publicsuffix](https://raw.githubusercontent.com/publicsuffix/list/master/public_suffix_list.dat) - [Ransomware Database](https://docs.google.com/spreadsheets/u/1/d/1TWS238xacAto-fLKh1n5uTsdijWdCEsGIM0Y0Hvmc5g/pubhtml#) - [University Domains and Names Data List](https://raw.githubusercontent.com/Hipo/university-domains-list/master/world_universities_and_domains.json) - [whoisxmlapi](https://www.whoisxmlapi.com/support/supported_gtlds.php) ### 工作工具 - [Domain Filtering](https://github.com/maravento/vault/tree/master/dofi) ## 反向链接 - [Awesome Open Source: Blackweb](https://awesomeopensource.com/project/maravento/blackweb) - [Community IPfire: url filter and self updating blacklists](https://community.ipfire.org/t/url-filter-and-self-updating-blacklists/6601) - [covert.io: Getting Started with DGA Domain Detection Research](http://www.covert.io/getting-started-with-dga-research/) - [Crazymax: WindowsSpyBlocker](https://crazymax.dev/WindowsSpyBlocker/) - [egirna: Allowing/Blocking Websites Using Squid](https://www.egirna.com/blog/news-2/allowing-blocking-websites-using-squid-5) - [Jason Trost: Getting Started with DGA Domain Detection Research](https://jason-trost.medium.com/getting-started-with-dga-domain-detection-research-89af69213257) - [Kandi Openweaver: Domains Blocklist for Squid-Cache](https://kandi.openweaver.com/shell/maravento/blackweb) - [Kerry Cordero: Blocklists of Suspected Malicious IPs and URLs](https://cordero.me/blocklists-of-suspected-malicious-ips-and-urls/) - [Keystone Solutions: blocklists](https://keystonesolutions.io/solutions/blocklists/) - [Lifars: Sites with blocklist of malicious IPs and URLs](https://lifars.com/wp-content/uploads/2017/06/LIFARS_Guide_Sites-with-blocklist-of-malicious-IPs-and-URLs.pdf) - [Opensourcelibs: Blackweb](https://opensourcelibs.com/lib/blackweb) - [OSINT Framework: Domain Name/Domain Blacklists/Blackweb](https://osintframework.com/) - [Osintbay: Blackweb](https://osintbay.com/tool/blackweb) - [Reddit: Blackweb](https://www.reddit.com/r/AskNetsec/comments/w1yqd9/does_anyone_know_any_free_database_for_url/) - [Secrepo: Samples of Security Related Data](http://www.secrepo.com/) - [Segu-Info: Análisis de malware y sitios web en tiempo real](https://blog.segu-info.com.ar/2019/07/analisis-de-malware-y-sitios-web-en.html) - [Segu-Info: Dominios/TLD dañinos que pueden ser bloqueados para evitar spam y #phishing](https://blog.segu-info.com.ar/2024/05/dominiostld-daninos-que-pueden-ser.html) - [Soficas: CiberSeguridad - Protección Activa](https://soficas.com/noticias/proteccion-ciberseguridad.html) - [Stackoverflow: Blacklist IP database](https://stackoverflow.com/a/39516166/8747573) - [Wikipedia: Blacklist_(computing)](https://en.wikipedia.org/wiki/Blacklist_(computing) - [Xploitlab: Projects using WindowsSpyBlocker](https://xploitlab.com/windowsspyblocker-block-spying-and-tracking-on-windows/) - [Zeltser: Free Blocklists of Suspected Malicious IPs and URLs](https://zeltser.com/malicious-ip-blocklists/) - [Zenarmor: How-to-enable-web-filtering-on-OPNsense-proxy?](https://www.zenarmor.com/docs/network-security-tutorials/how-to-set-up-caching-proxy-in-opnsense#how-to-enable-web-filtering-on-opnsense-proxy) ## 注意事项 - 本项目包含第三方组件。 - 变更必须通过 Issues 提出。不接受 Pull Requests。 - BlackWeb 专为 [Squid-Cache](http://www.squid-cache.org/) 设计,由于拦截的域名数量巨大,不建议在其他环境(DNSMasq, Pi-Hole 等)中使用,或将其添加到 Windows Hosts 文件中,否则可能导致变慢或崩溃。**使用风险自负**。有关更多信息,请查看 [Issue 10](https://github.com/maravento/blackweb/issues/10#issuecomment-650834301) - **Blackweb 本身不是黑名单服务**。它不独立验证域名。其目的是整合并重新格式化公开黑名单来源,以使其与 Squid 兼容。 - 如果您的域名出现在 Blackweb 中,且您认为这是一个错误,您应该查阅公开来源 [SOURCES](https://github.com/maravento/blackweb#sources),以确定它被列在哪里,并联系该列表的维护者请求移除。一旦域名从上游来源中移除,它将在下次更新时自动从 Blackweb 中消失。 您也可以使用以下脚本执行相同的验证: ``` wget https://raw.githubusercontent.com/maravento/blackweb/refs/heads/master/bwupdate/tools/checksources.sh chmod +x checksources.sh ./checksources.sh ``` 例如: ``` [?] Enter domain to search: kickass.to [*] Searching for 'kickass.to'... [+] Domain found in: https://github.com/fabriziosalmi/blacklists/releases/download/latest/blacklist.txt [+] Domain found in: https://hostsfile.org/Downloads/hosts.txt [+] Domain found in: https://raw.githubusercontent.com/blocklistproject/Lists/master/everything.txt [+] Domain found in: https://raw.githubusercontent.com/hagezi/dns-blocklists/main/domains/ultimate.txt [+] Domain found in: https://raw.githubusercontent.com/Ultimate-Hosts-Blacklist/Ultimate.Hosts.Blacklist/master/hosts/hosts0 [+] Domain found in: https://sysctl.org/cameleon/hosts [+] Domain found in: https://v.firebog.net/hosts/Kowabit.txt Done ``` ## 赞助者 [![Stargazers](https://bytecrank.com/nastyox/reporoster/php/stargazersSVG.php?user=maravento&repo=blackweb)](https://github.com/maravento/blackweb/stargazers) ## 贡献 我们感谢所有对本项目做出贡献的人。感兴趣的人可以通过发送新列表的链接来做出贡献,以将其包含在本项目中。 特别感谢:[Jhonatan Sneider](https://github.com/sney2002) ## 赞助此项目 [![Image](https://static.pigsec.cn/wp-content/uploads/repos/2026/03/163afef48a080140.png)](https://paypal.me/maravento) ## 项目许可证 [![GPL-3.0](https://img.shields.io/badge/License-GPLv3-blue.svg)](https://www.gnu.org/licenses/gpl.txt) [![CC BY-NC-ND 4.0](https://img.shields.io/badge/License-CC_BY--NC--ND_4.0-lightgrey.svg)](https://creativecommons.org/licenses/by-nc-nd/4.0/deed.en) ## 免责声明 本软件按“原样”提供,不提供任何形式的明示或暗示的保证,包括但不限于适销性、特定用途适用性和非侵权性的保证。在任何情况下,作者或版权持有人均不对任何索赔、损害或其他责任负责,无论是在合同诉讼、侵权行为还是其他方面,由本软件或本软件的使用或其他交易引起、与之相关或与之相关。 ## 异议 由于最近计算机术语的任意更改,有必要阐明与本项目相关的术语 **blacklist** 的含义和内涵: *在计算中,blacklist(黑名单)、denylist(拒绝列表)或 blocklist(拦截列表)是一种基本的访问控制机制,允许所有元素(电子邮件地址、用户、密码、URL、IP 地址、域名、文件哈希等)通过,除非明确提及。列表上的那些项目被拒绝访问。相反的是 whitelist(白名单),这意味着只有列表上的项目才能通过正在使用的任何门。* 来源 [Wikipedia](https://en.wikipedia.org/wiki/Blacklist_(computing) 因此,**blacklist**、**blocklist**、**blackweb**、**blackip**、**whitelist** 及类似术语与种族歧视无关。
标签:ACL, IP 地址批量处理, parental control, Squid-Cache, Streamlit, Web缓存, 互联网安全, 反僵尸网络, 反追踪, 域名黑名单, 威胁情报, 密码管理, 广告拦截, 应用安全, 开发者工具, 恶意软件防护, 白名单, 系统运维, 网络安全, 网络安全, 网络安全研究, 访问控制, 逆向工具, 隐私保护, 隐私保护, 黑名单