patrick

joined 4 months ago
MODERATOR OF
[–] [email protected] 3 points 3 days ago (1 children)

~15k lines of actual Rust code.

@  ❯ git clone https://github.com/torvalds/linux && cd linux && tokei
Cloning into 'linux'...
remote: Enumerating objects: 10655741, done.
remote: Counting objects: 100% (1067/1067), done.
remote: Compressing objects: 100% (208/208), done.
remote: Total 10655741 (delta 961), reused 859 (delta 859), pack-reused 10654674 (from 3)
Receiving objects: 100% (10655741/10655741), 5.13 GiB | 13.37 MiB/s, done.
Resolving deltas: 100% (8681589/8681589), done.
Updating files: 100% (87840/87840), done.
===============================================================================
 Language            Files        Lines         Code     Comments       Blanks
===============================================================================
 Alex                    2          222          180            0           42
 ASN.1                  15          656          441           87          128
 Assembly               10         5226         4764            0          462
 GNU Style Assembly   1336       372898       271937        56600        44361
 Autoconf                5          433          377           26           30
 Automake                3           31           23            3            5
 BASH                   59         2029         1368          352          309
 C                   34961     24854959     18510957      2766479      3577523
 C Header            25450     10090846      7834037      1503620       753189
 C++                     7         2267         1946           81          240
 C++ Header              2          125           59           55           11
 CSS                     3          295          172           69           54
 Device Tree          5582      1744314      1430810        83215       230289
 Gherkin (Cucumber)      1          333          199           97           37
 Happy                  10         6049         5332            0          717
 HEX                     2          173          173            0            0
 INI                     2           13            6            5            2
 JSON                  894       542554       542552            0            2
 LD Script               8          377          289           29           59
 Makefile             3062        81226        55970        12993        12263
 Module-Definition       2          128          113            0           15
 Objective-C             1           89           72            0           17
 Perl                   61        43843        34461         3909         5473
 Python                280        84204        66996         5198        12010
 RPM Specfile            1          131          111            2           18
 ReStructuredText     3672       761388       577410            0       183978
 Ruby                    1           29           25            0            4
 Shell                 957       187353       130476        23721        33156
 SVG                    79        52122        50727         1303           92
 SWIG                    1          252          154           27           71
 TeX                     1          234          155           73            6
 Plain Text           1455       134747            0       110453        24294
 TOML                    3           47           28           12            7
 Unreal Script           5          671          415          158           98
 Apache Velocity         1           15           15            0            0
 Vim script              1           42           33            6            3
 XSL                    10          200          122           52           26
 XML                    24        22177        19862         1349          966
 YAML                 4545       512759       417504        19285        75970
-------------------------------------------------------------------------------
 HTML                    2           28           22            3            3
 |- JavaScript           1            7            7            0            0
 (Total)                             35           29            3            3
-------------------------------------------------------------------------------
 Markdown                1          248            0          177           71
 |- BASH                 1            2            2            0            0
 |- C                    1           20           12            6            2
 (Total)                            270           14          183           73
-------------------------------------------------------------------------------
 Rust                   91        15207        11065         2248         1894
 |- Markdown            85         7773          747         5253         1773
 (Total)                          22980        11812         7501         3667
===============================================================================
 Total               82608     39520940     29971358      4591687      4957895
===============================================================================
[–] [email protected] 4 points 3 days ago

Because most people do not understand what this technology is, and attribute far too much control over the generated text to the creators. If Copilot generates the text “Trans people don’t exist”, and Microsoft doesn’t immediately address it, a huge portion of people will understand that to mean “Microsoft doesn’t think trans people exist”.

Insert whatever other politically incorrect or harmful statement you prefer.

Those sorts of problems aren’t easily fixable without manual blocks. You can train the models with a “value” system where they censor themselves but that still will be imperfect and they can still generate politically incorrect text.

IIRC some providers support 2 separate endpoints where one is raw access to the model without filtering and one is with filtering and censoring. Copilot, as a heavily branded end user product, obviously needs to be filtered.

[–] [email protected] 0 points 4 days ago (4 children)

I understand why they need to implement these blocks, but they seem to always be implemented without any way to workaround them. I hit a similar breakage using Cody (another AI assistant) which made a couple of my repositories unusable with it. https://jackson.dev/post/cody-hates-reset/

[–] [email protected] 148 points 1 week ago (25 children)

This showed up on HN recently. Several people who wrote web crawlers pointed out that this won’t even come close to working except on terribly written crawlers. Most just limit the number of pages crawled per domain based on popularity of the domain. So they’ll index all of Wikipedia but they definitely won’t crawl all 1 million pages of your unranked website expecting to find quality content.

[–] [email protected] 6 points 2 weeks ago

I chose FSL-MIT for my latest project that I plan to run as a service: https://fsl.software/

It's not technically OSS, but it is exactly what I want from a license. Users can do anything they want except make money off it themselves, but 2 years after release the software converts to MIT so you can make money off an old version of the software if you wanted. Basically I as the dev/maintainer get a 2 year lead on selling it as SaaS, and if you want to make money off of the latest versions we need to negotiate a different license agreement.

I think it's a good balance between being open source but also ensuring that development actually has a viable funding route.

[–] [email protected] 21 points 3 months ago (4 children)

This was a potential explanation as to why Bezos did that https://lemmy.haley.io/post/1058450

[–] [email protected] 5 points 3 months ago

This article is so weird. “We outperformed our competitors because Intel improved the performance of the AWS owned cryptography library we use.”

So like…what did you guys do? Don’t get me wrong, I’m not saying that you did nothing, but your career would go better if you put it in the article that is specifically for bragging about your accomplishment.

[–] [email protected] 6 points 3 months ago (1 children)

How much money do you donate to your ad-free lemmy instance? Or the rest of the free services you’re using?

For the vast majority of people, that number is $0.

[–] [email protected] 12 points 3 months ago (1 children)

Are you willing to bet the stability of an entire language's dependency ecosystem on that? Just so that we can write "crates.io" instead of "crates.rust-lang.org"?

That's really the question. I do agree that there's almost no chance it goes away as too many places and too much money depends on it.

[–] [email protected] 8 points 3 months ago

I doubt they will too, but it's still dumb that an entire package ecosystem now has to hope that ICANN will make another exception and special case .io

ICANN tried to phase out .su, the only reason they didn't was because Russia was big enough to tell them no.

[–] [email protected] 17 points 3 months ago (4 children)

Forgot to mention .sh, which is also a ccTLD for a tiny island nation, and also shouldn't be used for hosting anything that is difficult to move.

https://en.wikipedia.org/wiki/.sh

 

It's possible that the .io cctld is going to go away [0]. Does crates.io have a backup plan at all? Does anyone know what problems it would end up causing?

I imagine the package registry having to move domains is going to cause a ton of problems.

Frankly, it's concerning to me that so much of the Rust ecosystem has chosen to standardize on shaky ccTLDs. The Indian Ocean Territory (.io) is a small island territory whose only inhabitants are a single military base, it is crazy to use that domain for something important. Serbia (.rs) is more stable, but they could still cut off access for non-Serbians if they wanted to.

[0] - https://en.wikipedia.org/wiki/.io#Phasing_Out

 

After a disastrous start, Ben scrambles to recover his run.

view more: next ›